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SCHOOL PRINCIPALS SOMETIMES SHOW SO MUCH FAVORABLE BIAS in rating 


PLAYING WITH BLOCKS ISN'T POPULAR WITH TODAY'$kindergarteners 


NONSENSE WORDS CAN BE BETTER THAN REAL WORDS inword discrimination 


0. > " It Says in the JER... 


their teachers that their judgments aren't very meaningful, Lord and Cole 
have found. In a study to see if data from personality tests is useful in pre- 
dicting success in teaching, the researchers encountered a serious obstacle 
because principals tended to describe all their teachers very favorably. 
When the principals hadto make a forced-choice method of rating, their eval- 
uations became more realistic. Predictions of teaching success based on 
personality data are more accurate than those based on conventional aca- 


data, Lord and Cole report. 


depsite the emphasis on block play in the kindergarten curriculum, accord- | 
ing to a California study by Margolin and Leton. The researchers showed 
200 Los Angeles kindergarten children 10 pairs of photographs depicting block 
play activities and other forms of kindergarten activity. “Which of these 
would you rather be doing?", the vounesters were asked. Two-thirds voted 
for non-block activities. Тһе girls were particularly ‘‘anti-block’’. 


drills aimed at improved spelling ability, Mason reports. A group of stu- 
dents who practiced detecting small differences between nonsense words did 
better on a spelling test than did students who drilled on the words them- 
selves. For developing progressive visual discrimination of words, Mason 
suggests a series of graded “попвепве”” exercises. 


LEARNING DISABILITIES DO NOT OCCUR MOST FREQUENTLY IN FIRST- BORN 


MALES. Males with sisters do not demonstrate a high incidence of learning 
disability. Males with learning difficulties do tend to-have more brothers 
than sisters. These are the conclusions of Hodges and Balow after studying 


261 Minnesota slow learners in terms of their birth positions with regard to 
siblings. 


‘CITIZEN ADVISORY COMMITTEES CAN MAKE HEALTHY CONTRIBUTIONS to the 
public schools if they avoid certain “ехсеввев and mistakes", O'Neal says 
after studying the movement in New York state. A successful lay committee, 
says O'Neal, is one which (1) pays attention to the spirit of democracy in its 
operation; (2) is appointed by the board of education after it is certain that 
advice is really wanted, operates within the framework of board policy, and 
reports back to the board; (3) is given a reasonabletime limit; (4) is composed 
of members who serve as individuals and who represent a community cross- 
Section, and (5) avoids minority reports. 


WHEN IT COMES TO PREDICTING TEACHING SUCCESS in the social studies, un- 
defined “Q” factors are more important than commonly-held criteria, Ellis 
concludes after examining the backgrounds of 44 ‘‘outstanding’’ teachers and 
26 **average or below-average” teachers in Texas. Тһе researcher could 
find no differences of statistical significance among the groups evaluated on 
the basis of patterns of preparation, quantity of preparation, or quality of 
preparation, although the general trend of the differences tended to favor the 
the outstanding teachers. 


MARRIAGE PREPARATION COURSES NOW HAVE A TOE-HOLD in a majority of 
colleges and universities, if a California study by Manheim is representative. 
The typical situation is a one-semester, three-unit course offered by either 
the sociology or home economics departments. It is not likely to bea required 
course for other than home ec majors. More women than men will be en- 
rolled. There is no generally accepted text. Films are widely usedas teach- 
ing aids. There is a little better than a 50 percent chance that the instructor 
will have majored in sociology, psychology, or home economics. 


€ 


WHETHER AMERICAN INDIANS HAVE LEADERSHIP TRAITS that differ from those 
of Anglo Americans depends on what psychological test you use, Cassel and 
Sanders found when they compared students in Arizona. Using a test that 
assesses values or philosophy related to leadership, there was no significant 
difference between 100 Anglo American high-school youths in Phoenix and 62 
Apache Indian students in Globe. However, using а test that measures the 
pattern for influencing others, the Indians tended to be more autocratic, the 
Anglo Americans more democratic. 
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MARRIAGE PREPARATION COURSES IN 
SOUTHERN CALIFORNIA COLLEGES 


HENRY L. MANHEIM 
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Introduction 


COURSES DESIGNED to prepare students for 
a successful and happy married life have appeared 
in American colleges with increasing frequency in 
recent years. Since World War II, the great num- 
bers of veterans returning to college, their in- 
Creased age, and the larger proportion of married 
Students, have given much impetus to the develop- 
ment of such courses. Further incentive comes 
from the growing college enrollments and an in- 
creasing acceptance, by administrators ‚о the view 
that the college's purpose is to prepare students not 
merely for a vocation, but also to assistthem inthe 
fulfillment of their roles as effective citizens of a 
community. Because of the relative newness of 
this type of education such courses vary con- 
Siderably in their content, approach, materials, 
goals, and effectiveness. Every semester, every 
year, sees many schools adding such courses, 
changing them, or perhaps dropping them from their 
curricula. 

The present study was undertaken to shed 
some light on some of the many problems which 
confront those whose professional interest lies in 
this area, and to attempt to supply detailed infor- 
mation about some current marriage preparation 
courses. Little specific information is available 
as to how many institutions have recognized their 
responsibilities,in the area of marriage preparation 
education, and how many have implemented this 
recognition by establishing courses designed to help 
do this. Furthermore, inthose institutions which 
do offer such courses, how are they conducted? 
That is, what materials are inuse, what teaching 
techniques, whoare the students, what are the spe- 
cific goals which the instructor attempts to attain, 
what is the instructor’s own background and train- 
ing, how successful are the courses, and so on? 


* Footnotes willbe found at the end of the article. 


In the present investigation, colleges insouth- 
ern California were studied. Not onlyare there 
many junior colleges, colleges, and universities in 
this area, but specific impetus was given to the field 
of marriage preparation education by the Californig 
Study of General Education in the Junior College. 
Thus, while not necessarily typical of the entire 
country, it is reasonable. to assume that the courses 
offered in this geographical area comprise a rep- 
resentative cross-section of such courses. 


Design of the Study 


Questionnaires were sent to 58 junior colleges, 
colleges, and universities in southern California, 
This was the total number of liberal arts or prima- 
rily teacher preparatory schools in the area, and 
included 30 public junior colleges, 2 private junior 
colleges, and 26 colleges and universities, both 
public and private. In the following discussion, the 
junior colleges are referred to as GroupI, while 
the colleges and universities comprise Group II. 
АП of Group I were coeducational institutions, and 
all but seven of Group II were. Fifty-two of the 58 
institutions (90 per cent) respondedto the question- 
naire. Those which did not were four public junior 
colleges and two denominational colleges. 

In each case the questionnaire and an explana- 
tory letter were sent to the Dean of Curriculum, or 
equivalent officer, for forwarding to the appropriate 
person. In some of the larger institutions, several 
questionnaires were sent to various division chair- 
men. Information was requested about courses 
which were “designed to prepare students for mar- 
riage and for achieving a happy andsuccessful fam- 
ily life. ’’ This was qualified by noting that the sur- 
vey was not interested in the **usual academically- 
oriented course which studies mzrriage as a social 
institution. '* 
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Results 


Table I shows the number of institutions of - 
fering marriage preparation courses. Of the Group 
I institutions, 86 per cent offer one or more 
courses, while 67 per cent of the Group II institu- 
tions fall into this category, making 77 per cent of 
the total. Over half the institutions offer just one 
course, with a few offering two or more. One in- 
stitution reported offering six such courses. 

There is great variation in the department 
under which the courses are offered, as shown by 
Table II. Since some institutions reported on more 
than one course, the totals here do not necessarily 
coincide with those for other tables. Furthermore, 
in some cases the course was offered under two or 
more departments as a joint effort. Departments 
of Sociology are responsible for the greatest num- 
ber of courses in all three categories, with Home 
Economics running a close second. In Grcup I, So- 
cial Science departments аге in the third place, 
probably because many of the junior colleges do not 
have their faculties organized into separate depart- 
ments. In Group П, probably influenced by the 
number of denominational schools represented, Re- 
ligion occupies the third step. 

Table III illustrates the fact that almost two- 
thirds of all the courses for which data are avail- 
able are three-semester-unit courses. Almost in- 
variably these meet for one hour,three timesa 
week. Twenty-nine per cent are two-unit courses, 
and in Group I there was one four-unit course, and 
one which carried no unit credit. It willalso be 
noted from this table that data are available for a 
total of 62 courses, offeredinthe 52 institutions re- 
Sponding. Inallbuta few cases these are one- 
Semester courses, 

For the most part, theseare elective courses. 
Of those Group I institutions which offer any mar- 
riage preparation courses, only 17 рег centrequire 
any of their students to take any of the courses. 
For Group II the figure is 63 per cent, and for the 
total group, 35 per cent. As will be seen from 
Table IV, it is almost entirely the Home Economics 
majors who are requiredto take these courses, and 
therefore the 63 per cent figure mentioned above 
does not include a large proportion of students. In 
only two institutions are all students required to 
take one of these courses. 

As mentioned above, allbutsevea:astitutions 
are coeducational. In no case does a coed institu- 
tion offer a course which is not open to both men 
and women. Interestingly, one of the women’s col- 
leges reportsthatits marriage course is coeduca- 
tional, with about half the class usuaiiy composed 
of the fiances of the women students. Table V 
Shows the approximate distribution of men and 
women in coed classes. In 47 per cent of the Group 
Iclasses, for example, the men and women are 
roughly evenly represented. In Group II, on the 
other hand, it can beseenthatthe classes are pre- 
dominantly composed of women. This is, no doubt, 


affected by the nine Group II institutions which re- 
quire their Home Economics majors to take these 
courses. Also it seems reasonable to believe that 
the men attending collegesand universities are 
more concerned with vocationalthan with marriage 
preparation. For the total number of institutions, 
as one might surmise, more women than men tend 
to take such courses. However, the number of ques- 
tionnaires which did not include this information is 
large enough so that these conclusions might be 
materially affected. 

There is a great variety of textbooks in use in 
the different institutions. Only 44 of the 62 courses 
were reported as having a required text. The other 
18 either do not require any text or the information 
was not given on the questionnaire. Nineteen dif- 
ferent texts were mentionedas being used, but only 
two had more than three adoptions. 

Table VI indicates the number of courses in 
which audio-visual aids are used. The base of the 
percentages shown is the number of courses for 
which data are available. Since more than one type 
of aid is used in many courses, the percentages do 
not total 100. As would be expected, films are the 
most widely used, and a little more surprisingly, 
are used more frequently in Group II than in Group 
г 

The teaching approaches and techniques used. 
are tabulated in Table VII. None of GroupIreported 
using the lecture method alone, although there were 
a large number which did not give any information 
on this question. A combination of the lecture and 
discussion methods was by far the most popular, 
being used in almost two-thirds of Group II, and 
half of the total number of institutions. 

The Group I institutions tend to be smaller and 
their faculties tend to be less specialized than in 
Group II. Instructors are frequently assigned 
courses in more than one field. This undoubtedly is 
at least part of the reasonfor the great diversi- 
fication of instructors' majors shown in Table VIII. 
However, 58 per cent of the major fields reported 
in Group I are either psychology, sociology or so- 
cial science. In Group П, it will be noted, almost 
two-thirds of the majors are composed of sociology, 
religion, or home economics. For the total group, 
sociology and psychology comprise 44 per cent with 
the balance ranging from music to zoology. Some 
instructors reported more than one major field, and 
in such cases they were all tabulated, since the pur- 
pose of the table is to give an indication of how well 
prepared the instructor is to teach this course. 

The limitations imposed by a brief mail ques- 
tionnaire such as this one were encountered most 
acutely in summarizing the responses on course 
goals and objectives. Tocompletely presentthe gist 
of the various responses would involve at least a 
verbatim quotation of each. A few of the general 
trends may be discussed here, however. In Group 
Ithe wording of the responses indicates thatthe 
thinking seems to run toward the actual preparation 
for marriage, including mate selection. There is а 


TABLE II 
2 DEPARTMENTS UNDER WHICH COURSES ARE OFFERED 


Group I Group II Total 
Department N percent М per cent N per cent 


TABLE I 
NUMBER OF MARRIAGE PREPARATION COURSES 956 еҢ Studies 

(AS DEFINED) OFFERED BY INSTITUTIONS Orientation 1 3 1 3 2 3 
Group I Group II Total Health Education 3 9 3 5 
Бшк. с N per cent N per cent М per cent Home Economies 7 23 10 30 17 26 
Human Relations 1 3 1 2 
0 $ + 8 5 E 5 Psychology 1 3 3 9 4 6 
1 20 72 10 42 30 57 Religion 5 T Я А 
: = T : 8 Е ч Science 3 10 3 5 
i: 5 1 : ч Е > Social Science 5 17 5 8 
di guo tee aL e Sociology 95:31 119€ 94-205 31 
Total 28 100 24 100 52 100 ЗВЕНЕ 3 | 7 
Not Stated S pend E жара е 
Total* 30 100 33 100 63 100 


* Note: Since some institutions offer more than one 
course, these totals may differ from totals in other 
tables. Also, some institutions reported courses 
offered jointly under more than one department. 
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TABLE III 


NUMBER OF SEMESTER UNITS OF COURSES 


TABLE IV 


NUMBER OF INSTITUTIONS WHICH REQUIRE COURSE 


OF VARIOUS CATEGORIES OF STUDENTS 


Student Category 


GroupI 


Group II 


Total 


NM UC < CHER I Group ii Total 
of Units N per cent М per cent М per cent 
0 1 3 1 2 
1 $ 3 2 6 3 5 
2 10 34 8 25 .18 . 29 
3 16 54 22 69 38 60 
4 1 3 1 2 
Not Stated ue aris Um mte oer Lem eas d^ 
Total* 30 100 32 100 62 100 


* See note, Table II 


АП Students 
Freshmen 
Juniors 
Seniors 

Upper Division 


Home Economics 
Majors 


Nursery School 
Training Majors 


Secondary Teacher 
Training Majors 


Sociology Majors 


Not Stated 
Total 


el - 


11 


S + 
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TABLE V 


NUMBER OF COED COURSES WITH VARIOUS 
PERCENTAGES OF MEN ENROLLED 


Darien Eu I Group II Total 

of Men N percent М percent М per cent 
0- 35 I9 099» 740 38 17 30 

36- 65 14:-4T.- ТО 38 24 43 

66-100 4 18 1 4 5 9 

Not Stated шашы A 222041102218 

Total 30 100 26 100 56 100 

TABLE УІ 


NUMBER OF COURSES IN WHICH AUDIO-VISUAL 
MATERIALS ARE USED 


Group I Group II Total 


Materials N per cent* N per cent* N per cent* 
Movies 197 “63 26 81 45 73 
Slides and 

Film-strips 1 3 2 6 3 5 
Others (models, 

charts, etc.) 7772129 2 6 97715 
None ог No 

Answer 7 ДЗ 6 19. 13 21 


*Note: The base of these percentages is the total 
"number of courses for which data are available. 
Since many courses report the use of more than 
one type of audio-visual material, the sum of 


these percentages is greater than 100. 
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Stress on personal adjustment, formation of atti- 
tudes, and the more functionalaspects of marriage. 
In Group II, on the other hand, there is asomewhat 
greater stress laid on marriage as a social institu- 
tion and the various specialized home economics 
goals, such as training inchildcare, home manage- 
ment, and home furnishing. This, of course, re- 
flects the comparatively large number of Group II 
institutions which require these courses of their 
home economics majors, InGroup I, however, there 
is also а considerable amount of emphasis placed 
upon the same goals as mentioned for Group I. 

Despite the fact that similar limitations apply 
to the portion of the questionnaire in which the re- 
spondents were asked what theyconsidered the most 
valuable part of the course, there was much greater 
agreement among all the answers. It was reported 
almost invariably that the more practical parts of 
the course were considered most valuable by both 
Students and instructors. This was stated in such 
terms as sexual aspects, courtshipand mate selec- 
tion, student self-analysis, personal adjustment, 
and advisory facilities. This was particularly true 
of Group II, indicating that perhaps there is a sig- 
nificant discrepancy between the stated objectives 
of the courses offered in the colleges and universities, 
and the portions which аге of most interestandval- 
ue to the students. 


Conclusions 


Certain conclusions and generalizations may 
be drawn from these data regarding southern Cali- 
fornia liberal arts institutions of higher learning. 

In the first place, the unusually and unexpect- 
edly high rate of returns of the questionnaire may be 
interpreted as an indication of interest in marriage 
preparation courses on the part of the administra- 
tors. This, in itself, is an encouraging note. Fur- 
thermore, well over three-fourths of the institutions 
have actually attempted to meet these needs of their 
Students by setting up courses which are designed to 
do this. Thisis particularly true of the junior col- 
lege, as compared to the colleges and universities, 
with seven out of eight junior colleges offering one 
or more such courses. This is probably due to the 
fact that the junior college administrative organiza- 
tion is usually more flexible. Furthermore, since 
the junior college program is, by its very nature, 
aimed at a widér cross section of the population, 
while the four-year college and university provide 
professional training, it is not surprising to see this 
trend. 

There is a rather noticeable difference of opin- 
ion as to the department under whose aegis this type 
of course should be presented. Between them, so- 
ciology and home economics sponsor over half, with 
the balance scattered over several other depart- 
ments. One might hazard the opinion that the courses 
presented by home economics departments tend to be 
concerned more with the specialized and practical 
techniques concerned in home managementand child 


care, while the sociological approach would tend t 
Stress the individual personality and personaladjust 
ment to marriage. 

On the administrative level, it will be seen tha 
with only one exception, these courses аге offeret 
as regular parts of the curriculum, for which th 
Student receives unit credit. However, they ar: 
Still regarded as somewhat peripheralin nature, по 
being required of many students other than home e 
conomics majors. Undoubtedly there is a trend to 
ward requiring an increasing number of students t 
enroll for such courses. Because of the greate 
tendency of men to start their vocational specializa 
tion in the upper division, and because of the greate 
emphasis on vocational training for men than fo1 
women, as well as the fact that many home econom 
ies majors are required to take marriage prepara 
tion courses, it is not surprising to see that in all 
courses there is a smaller enrollment of men thar 
women. 

In the classroom, onlytwotexts һауе тог 
than three adoptions, and in terms of the total num 
ber of courses for which data are available, noteven 
the most widely used text has a good hold on the 
market. In about half of the total number of courses. 
a text is required which was written for this type of 
course, while the rest of the courses use other, 
more general texts, or none at all. 

The value of films as teaching aids seems tc 
be widely recognized, although their widespread use 
тау be interpreted as evidence of dissatisfactioi 
with the adequacy of the straight lecture method 
Nevertheless, not only films, butother aids and 
techniques such as recordings and discussions are 
coming more and more into their own. Тһе newer 
techniques such as the use of buzzsessions, panels, 
and resource persons, requiring more time and 
preparation on the part of the instructor as they do, 
have yet to be fully accepted. 

Perhaps the factor mostindicative of the new- 
ness of this portion of the curriculum is the widely 
varied background and preparation of those who are 
doing the teaching in it. Sociology, psychology, and 
home economics departments wouldseem to have the 
soundest claim on this field, and yetonlya little over 
half of the majors which were reported were in these 
three departments. This wide variety of background 
that is presently seen allows the instructors to bring 
a wealth of material to the classroom whieh is based 
upon wide personal experience, but itis probably not 
amiss to express regret that no more have had ex- 
tensive academic as well as practical tra ining in 
marriage and the family. The trend is, no doubt, in 
the direction of such increased training, although the 
facilities for this are still quite sparse. 

The stated goals and objectives of these courses 
tend to be more theoretically and academically stated 
in the colleges and universities than in the junior col- 
leges, as would be expected. However, it is inter- 
esting to note that at bothlevels the most successful 
and important portions of the courses seem to be the 
functional and practical rather than the theoretical. 
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By way of summary опе might lookatthe typ- 
ical course. It is likely to be a one-semester, 
three-unit course, offered by either the sociology 
or home economics department. It is not likely to 
be a required course for other thanhome economics 
majors, is a coed course, but probably has more 
women than men enrolled. Films are quite regular- 
ly used as teaching aids, especially the McGraw- 
Hill series. Approximately half of the instructors 
use the lecture method along with discussions, and 
perhaps other techniques such as group work, pan- 
els, or outside speakers, in addition. There is al- 
so a little better than one chance in two that the 
student will have an instructor who received ade- 
quate training inthis field by majoring in sociology, 
psychology, or home economics. 


By and large, institutions of higher learning 
in southern California have recognized that they 
have responsibilities to help their students prepare 
for marriage. Furthermore, they have accepted 
these responsibilities within the framework of their 
curricula. Insofar as this geographicalarea is rep- 
resentative of the rest of the country, here is an 
up-to-date picture of the situation throughout the 
nation’s colleges. Just how effective these efforts 
at marriage preparation have been will not be known 
until some form of follow-up study is made of those 
Students who have availed themselves of the oppor- 
tunity to take these courses. 


1. 


FOOTNOTES 


There are at least two previous studies related 
to the present one. In 1948-1949 Bowman con- 
ducted a similar study, but of a nation-wide 
sample. See Henry A.Bowman, ‘‘Marriage Ed- 

ucation in the Colleges, ’’ Journal of Social Hy- 
giene, 35:407-17, December, 1949. A region- 
al study of eleven southern states, using Bow- 
man’s questionnaire, appeared four years later; 
“Education for Marriage and Family Relations 
in Southern Colleges," Report by members of 
the Committee on the Teaching of Sociology of 
the Southern Sociological Society, Social 
Forces, 32:61-65, October, 1953. Subsequent 
to the collection of data for the present study, 
Landis published the results of his study of an- 
other national sample. See: Judson T. Landis, 
“Тһе Teaching of Marriage and Family Courses 
in Colleges, " Marriage and Family Living, 

21:36-40, February, 1959. 

This study, conducted in 1950 and 1951 at the 
University of California at Los Angeles, was 
sponsored jointly by the California State Junior 
College Association, the California State De- 
partment of Education, the UCLA School of Ed- 
ucation, and the American Council on Education. 

It is reported on in B. Lamar Johnson, General 


Education in Action (Washington: American 


Council on Education, 1952). Pp. xxvi + 409. 
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INTEREST OF KINDERGARTEN 


PUPILS IN BLOCK PLAY 


E. B. MARGOLIN and D. A. LETON 
University of California, Los Angeles 


BLOCK PLAY has been included among kinder- 
garten activities since Froebel first introduced it in 
1837, and it has now assumed a major role in the 
kindergarten curriculum. In succeeding years the 
blocks as originally introduced have undergone 
many changes, in shapes, insizes, and in their 
methods of use. At the end ofthe nineteenth centu- 
ry (1898) Hill designed blocks which were much 
larger and heavier than Froebel's. These could be 
used for construction activities because of the in- 
terlocking ends.. 

Around the 1940's, Pratt designed another var- 
iety of blocks within a geometric unit. These blocks 
would fit evenly when they were stacked or piledto- 
gether. There were squares, triangles, cylinders, 
and rectangles of long and short lengths, similar to 
those which are in common use atthe present time. 
There is now a great variety of blocks, some of 
hardwood, lightweight and hollow, others of com- 
pressed synthetics of various shapes and sizes. 

Johnson (1945) recognizedthe merit in blocks as 
an educational and expressivetool. Besides study- 
ing block activities, in which the specific develop- 
mental stages of the children were noted, she also 
recognized the use of blocks as an art medium, 
Today there are some teachers and curriculum ex- 
perts who perceive block play as an art medium, 
while others regard it as an important tool for con- 
Struction in dramatic play. There are some who 
feel that blocks should be used as an aesthetic me- 
dium, and others who recommend them as tangible 
forms of construction which may be evaluated by the 
teacher. It is quite obvious, then, that the blocks 
become different media in the hands of different 
teachers and pupils. 


The Problem 


The purpose of this study is to investigate child- 
rén’s interest in block play in the kindergarten 
where the situation requires total class participa- 
tion. It is concerned with the extent to which child- 
ren choose or prefer block play among their kinder- 
garten activities, indoors, and also the variation 
which children show in these preferences. 


The previous research on the use of block play 
falls into three categories: 1) studies of children's 
interests, preferences, or choices of play materials; 
2) studies of developmental stages in the useof 
blocks; and 3) studies of the values of block play in 
personality development. Although only those stud- 
ies pertaining to children’s interest, choices, and 
preferences of blocks are directly related to the 
present study, the research on developmental stages 
and personality development is also reported since 
they may provide explanations of children’s choices. 

The summary of the findings is as follows; 1) 
Blocks were among the least preferred play mater- 
ials chosen by boys and girls (Vance and McCall 
(20) ); 2) Blocks were among the most popular ma- 
terials chosen by boys (Farwell (6), VanAlstyne 
(19), and Besch (3) ); 3) Sex differences were evi- 
dent in the choices and in the amount of time spent 
with specific play materials (Bridges (4), Farwell 
(6), Van Alstyne (19), Farrell (5), Beach (3)); 4) 
Age, experimental maturity, and background deter- 
mine the way a child uses play materials (Guanella 
(9), Johnson (12), Updegraff and Herbst (18), Hartley, 
Frank and Goldenson (10), Amen and Rennison (1); 
Lebo, 1955; and Besch (3) );and 5) The block play 
of children must necessarily pass through specific 
developmental stages before understanding and a- 
bility in the activity can take place (Guanella (9), 
and Johnson (12) ). 


Subjects and Method 


The data were secured from 200 kindergarten 
children enrolled in twenty kindergartens of the Los 
Angeles City Schools. The kindergartens varied as 
to location, classroom grouping, teacher personal- 
ities, and instruction. The school communities 
ranged from low to high in levels of socio-econom- 
ic backgrounds. The curricula of the kindergartens 
were recommended by courses of study and by cur- 
riculum guides of the California State Depar tm ent 
of Education and the Los Angeles City Schools. 

The pupils included in the study ranged in age 
from four years-eleven months to five years-ten 
months. Five boys and five girls were selected 
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from each kindergarten using systematic sampling 
procedures. They had attended school for about six 
weeks before the study began, which presumably 
gave them time to form attitudes about school ex- 
periences. 

The stimulus material consisted of ten pairs of 
photographs. One photo of each pair portraye d 
children engaged in block play activities and the 
other portrayed children in other kindergarten ac- 
tivities. The other activities, referred to as non- 
block activities in this article, included painting, 
drawing, clay modeling, looking at books, listening 
to records, putting puzzles together, finger-paint- 
ing, playing with rhythm instruments, playing in 
the playhouse, and cutting and pasting collage. 

The ten pairs of photographs were presented to 
each child and after viewing each pair he was asked 
the question, ««Which of these would you rather be 
doing?’ The pairing of the photographs was rotat- 
ed so that there were ten different pairings. The 
pairing was changed after ten boys and ten girls 
had responded to the pairing. 


Analysis of the Data 


The responses which the children made with re- 
spect to choice of photos were analyzed for activity 
preferences and for sex differences. The tabulation 
of these data is shown in Table I. In the compari- 
son of block and non-block preferences, the largest 
percentage of children in a single category were 
represented in the 19 percent who showeda prefer- 
ence for eight non-block photos andtwo in the block 
activity. The nextlargest group was 15.5 percent 
which preferred seven non-block photos and only 
three block photos. The nextlargest group, 14 
percent of the children, showed a preference for 
nine non-block and only one block photo from all ten 
pairs. There were 9.5 percent of the children who 
preferred all ten non-block photos. These are sig- 
nificant figures when one considers the assumption 
that blocks were highly enjoyed by all children. 

The boys' choices were distributed more uni- 
formly over the range of choices. Seventeen boys 
preferred seven non-block and three block photos. 
Thirty-seven percent of the boys chose photos of 
non-block activities over the block activities. 

The greatest number of girls, twenty-six, chose 
eight non-block and two block photos. The next 
largest group, twenty girls, preferred nine non- 
block and one block photo. Nineteen chose all non- 
block photos. АП of theselarger figures represent 
significant preference among girls for the non-block 
photos. It has been found inthis study, asin others 
in the past, that boys show a greater preference 
for block activities than do girls. 

Table II shows the distribution of responses for 
boys and girls to each of the paired photos. The 
homogeneity of the two frequency distributions was 
tested by the use of chi square. For a X? value of 
65.24, the probability the two distributions would 
have arisen by chance isless than .001. It is con- 


cluded, then, that the two distributions are statis- 
tically different. The cells which contributed most 
to this difference were the latter three cells, in 
which the girls established predominant preference 
for non-block photos. 

The most popular was non-block 7, the rhythm 
instrument activity, which received 146 votes or 7.8 
percent of all possible choices for any of the non- 
block photos. Non-block 3, which showed children 
listening to records, wasnextin popularity with 140 
votes. Painting, non-block 9, received 138 votes, 
which is quite close to number 3. Theleast popular 
of all the non-block activities was the colored chalk 
activity, non-block 6, which received 114 votes. It 
was interesting to note that only one of the block 
photos ranked within the first eleven. (Table IV) 

Because the pairing of photographs was rotated 
it is possible to analyze the difference in magnitude 
of choices. The t test was utilized to exa mine the 
difference between the twodistributions. The mean 
of the distribution of differences was calculated 
(D-ED/N- 55.8). With at value of 2.44 and 9 
degrees of freedom, it is concluded that there is a 
significant difference between the observed distri- 
butions at the .05 level. The statistical conclusion 
is that the non-block photos were significantly fa- 
vored over the block photos. Тһе educational con- 
clusion is that the various non-block activities are 
significantly favored over block activities. 


Analysis of Sex Differences 


There were a total of 721 outof 2, 000 responses 
which showed a preference for blocks and 1,279 of 
the 2, 000 responses which showed a preference for 
non-blocks. Тһе percentage of children preferring 
blocks was 36.1, while 63. 9 percent preferred non- 
blocks. 

Using a fourfold contingency table, the frequency 
for boys and girls favoring blocks was analyzed by 
the use of chi square. The data for this anal ysis 
are shown in Table Ш. With a chi square value of 
300.14, it is found that the obser ved preferences 
differ significantly from chance distribution 
(p <.001). It is concluded that the classification of 
responses is not homogeneous. The proportion of 
girls who chose non-block photographs established 
a significant preference for non-block activities. 
Boys' preference for either block or non-block ac- 
tivities was not established. 


Summary and Conclusions 


This study investigatedthe preference of kinder- 
garten children for block andnon-block activities in 
the curriculum through their choices on paired pho- 
tos. The findings were: 1) there was a significant 
sex difference in preferences for block andnon- 
block activities; 2) non-block activities ranked sig- 
nificantly higher in preference than block activities; 
and 3) the classification of boys' and girls' respon- 
ses was not homogeneous. 
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TABLEI 


FREQUENCIES OF BOYS' AND GIRLS' PREFERENCES FOR 
BLOCK AND NON-BLOCK PHOTOGRAPHS 


Choices of Photos* 


Block Non-Block Boys Girls Totals Percentages 
10 0 11 2 13 6.5 
9 1 2 0 2 1.0 
8 2 2 1 3 1.5 
7 3 10 1 11 5.5 
6 4 16 2 18 9.0 
5 5 11 7 18 9.0 
4 6 11 8 19 9.5 
3 d 17 14 31 15.5 
2 8 12 26 38 19.0 
1 9 8 20 28 14.0 
0 10 0 19 19 9:5 


* A complete preference for photographs showing block activities over 
non-block activities om the ten choices is indicated as 10-0. Other 
combinations of choices are indicated in the same manner, 9-1, 8-2, 
and so on. 
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TABLE II 


COMPUTATIONS OF CHI-SQUARE TEST FOR HOMOGENEITY 
OF TWO FREQUENCY DISTRIBUTION 


Е —————————————————— 


Combination of Boys Girls aP 
Photo Choices (a (a) 

10-0 11 2 .81 
9-1 2 0 .0 
8-2 2 1 .33 
LO 10 1 .90 
6-4 16 2 .22 
5.-5 11 7 2212 
4-6 11 8 3.36 
Siac? 17 14 6.32 
2/8 12 26 17.79 
1-9 8 20 14.29 
0-10 0 19 19.00 
Total 100 100 65.24 

p=.50  díf.-11-1-210 xX” = 65.24 p<.001 

TABLE III 


CLASSIFICATION OF RESPONSES ON PHOTOGRAPH PREFERENCES 


nn, EEE 


Favor Blocks Favor Non-Blocks Total Responses 
Boys 493 507 1000 
Girls 228 112 1000 
Total 721 1279 2000 


жаралы лы ы шы алық лл eO мал ug 22)! сени олан ЖЕ voies ee ou dr Veo тарса 
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TABLE IV 


RANKING OF PHOTOGRAPHS IN ORDER OF POPULARITY 


Photo Number* Content Frequency Percentage 
МВТ Rhythm Instruments 146 rie) 
N.B. 3 Records 140 7.0 
N.B. 9 Painting 138 6.9 
N.B. 1 Clay 134 6.7 
N.B. 4 Puzzles 130 6.5 
N. B. 10 Collage 124 6.2 

Бе Floor Blocks 119 5.95 
N.B. 2 Books 118 5.9 
N.B. 5 Finger-painting 118 5.9 
N.B. 8 Playhouse 117 5.85 
N.B. 6 Colored chalk 114 5.7 

B. 5 Floor Blocks and Accessories 92 4.6 

B. 8 Floor Blocks and Accessories 78 3.9 

B. 3 Floor Blocks and Accessories 77 3.85 

ү: Ж Floor Blocks and 1 Accessory 62 3.1 

Bai Floor Blocks 60 3.0 

B. 2 Floor Blocks 59 2.95 

B. 6 Floor Blocks 59 2.95 

B. 10 Floor Blocks 58 2.9 

Bi Floor Blocks 57 2.85 


2000 100.00 


_— a LA М-ы лына ey es 
* N. B. and B. refer to the non-block and block contents of the photographs respectively. 
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The inclusion of block activities for all children 
in the kindergarten is based on the assumption that 
they learn from and enjoy these activities. The 
findings of this study make it necessary to questicn 
seriously this assumption. The unequal preference 
among boys and girls, and the fact that block acti- 
vities are ranked lower, tend to weaken this as- 
sumption. 

It seems evident from the data in this study and 
from findings in research that children do not pre- 
ter blocks as much as has been thought. There 
:ould be many reasons for this. 1) Perhaps their 
ise served greater purpose in former years when 
children lacked the variety of play materials now 
ivailable, or when their play was restricted. 2) 
The blocks may not have unique qualities to differ- 
ointiate them from their pre-school toys. 3) They 
nay lack instructional value because of limitations 
n features and concepts which a pupil might acquire 
rom them. 4)In view of the popular use and pre- 
scription of block play in the curriculum, an over- 
imphasis may have been placed on the importance 
Xf blocks and their contribution to the physical and 
nental growth of children. 5) Some children may 
lislike being part of a large group in which 25 or 30 
:'hildren participate simultaneously in block play. 
) It is plausible that any activity loses its interest 
/alue when it is singled out exclusively and used 
nore frequently than others. 7) It is also possible 
hat the children in this study were in different 
stages of development such as tho se recognized by 
Вайеу, Guanella and Johnson, and that this differ- 
nce may have accounted for the children’s variation 
n choices. 

From the standpoint of modern educational prac- 
ices it may be appropriate to re-evaluate the acti- 
rities in the kindergarten іп terms of their purposes. 
А curriculum which affords a variety of activities 
S probably superior to one which is based on a r e- 
stricted choice of activities at any given time. This 
tind of planning would more nearly meet the indi- 
'idual personality needs, of which educators are 
'enerally aware. 
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A COMPARATIVE ANALYSIS OF SCORES FROM 
TWO LEADERSHIP TESTS FOR APACHE 


INDIAN AND ANGLO AMERICAN YOUTH 


RUSSELL N. CASSEL 
Lompoc Unified Schools, California 
RICHARD А. SANDERS 
University of Arkansas 


THIS STUDY was concerned with comparing the 
Scores on two tests related to leadership for 72 A- 
pache Indian youth of mixed sex with similar test 
scores for 100 Anglo American youth of mixed sex. 
Also, comparisons were made onthe two leadership 
test scores between male and female Apache youth, 
and between the test scores and certain other data 
available. It is a portion of alarger study directed 
at the investigation of dynamics in volved in an ef- 
fective leadership function, and with the develop- 
ment of psychological tests for assessing such dy- 
namics. 


Tests Utilized 


The Leadership Q-sort Test (LQT) by R. Cassel, 
published by Psychometric Affiliates, Chicago, Il- 
linois, was utilized to assessthe individual's values 
ог personal philosophy in the leadership area. This 
test uses Q-methodology and provides a Pearsonr 
between the ratings which an indi vidual assigns to 
the 60 items, in terms of howimportant he believes 
the separate items are for the performance of an 
effective leadership role, and the mean ratings ob- 
tained for the items for various normative groups. 
Each of the items represents a critical character- 
istic of an effective leader, and was selected from 
more than 3, 000 such items contributed by 400 Uni- 
ted States Air Force (USAF) pr eflight cadets indi- 
cating the critical qualifications of the most out- 
standing leader of their acquaintance or experience. 
These 3,000 characteristics of an effective leader 
were arranged into six categories of related items, 
and determine the six part scores on the LQT: 1) 
personal integrity, 2) consideration for others, 3) 
mental health, 4) technical information, 5) decision 
making, and 6) teaching and communication. 

. Тһе Leadership Ability Evaluation (L AE) by В. 
Cassel and E. Stancik, published by Western Psy- 
chological Services, Los Angeles, California, was 
utilized to assess the characteristicleadership de- 
cision pattern of the individual. The 50 items on 
the LAE are about equally divided inthe following 
areas of life drama: home andfamily, school, work, 


recreation, and community. For each test item 
there are four multiple choice type ans wers, rep- 
resenting the four part scoresonthetest. The per- 
son taking the test plays the role of aleader, and 
the particular choice he makes determines the pat- 
tern which he characteristically uses to influence 
other persons or groups. Thefour part scores are 
as follows: 1) autocratic aggressive—a leader cen- 
tered decision pattern, 2) autocratic submissive — 
an expert or resource person centered decision pat- 
tern, 3) democratic or oooperative—a parliamenta- 
ry procedure centereddecision pattern, and 4) lais- 
sez faire—an individual or group member centered 
decision pattern. The total score onthe LAE is a 
composite based on a weighting of part scores; and 
with such weights as to discern optimally between 
groups of outstanding recognized leaders and non- 
select individuals. 


Groupa Utilized 


The experimental group consisted of all of the 
Apache Indian youth presently attending the Globe 
Public Schools in Arizona, and most of them lived 
on the San Carlos Indian Reservation. The group 
was made up of 33 males and 29 females, and 28 
were attending high school; while 34 were attending 
the junior high school. The school bus brought 
these youth to Globe each day from the reservation 
which was 26 miles away. Generally, their school- 
ing until the seventh grade was accomplished on the 
reservation, and when they entered the Globe pub- 
lic schools, they experienced for the first time con- 
tinuous interaction with Anglo American youth. The 
customs and beliefs of the Apache elders was the 
rule on the San Carlos Reservation, and itis pre- 
sumed that the two leadership test profiles will de- 
pict Apache leadership notions. They rangedin age 
from 13 to 19 years, with a mean (M) age of 14. 95 
and a standard deviation (SD) of 2.36 years. Their 
I.Q.'s, generally from an Otis group test, ranged 
from 55 to 111 (only one above 100), with a M of 77 
and а SD of 11.49. It is the impression of the Globe 
School instructional personnel that the true 1.0.78 
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TABLEI 


INTER-CORRELATION MATRIX AND DISCRIMINANT FUNCTION FOR THE *SIX PART SCORES ON THE 
LQT BETWEEN 86 ANGLO AMERICAN AND 62 APACHE INDIAN YOUTH 


De 
m 
ti 
=I UUU 8 
Multiple Correlation Regression 

The LQT, Part Scores 7%: Х, X, Х% Х| “Ату 2гу 3ry Apache Indian Youth Anglo American Youth Total Group 8 
E 
1. Consideration for Others ** — X; 1000 -198 -164 -193 -121 (-121) 000 016 M 69.97 71.83 71.05 о 
SD 6.89 7.94 7.57 "n 
Ei 
2. Technical Information X,  -198 1000 -052 -096 104 104 (081) 000 M 62. 03 60. 42 61.09 [v] 
SD 6.74 8.13 7.57 С 
3. Teaching and Communication X, -164 -052 1000 -052 078 078 058 (062 M 30. 47 29.83 30.10 5 
SD 4.29 3.70 3.97 o 
4. Decision Making X5 -193 -096 -052 1000 -035 -035 012 020 M 30.12 29.72 29.89 = 

sD 4.74 4.97 4.88 
a 
5. Mental Health Xg  -158 -491 -022 -035 -041 -041 -060 -020 M 62.85 63. 45 $ 63.20 ist 
sD 6. 84 7.49 7.23 Ei 
6. Personal Integrity* -365 -190 -140 -218 M 105.05 105. 48 105.30 Я 
5р 7.73 7.58 7.67 ш 

* Sixth part excluded because of experimental dependency R^ 014641 021202 025046 R Dias 088 + . 083 
** Decimal point omitted ***R^ 007938 007794 рына 


*** Corrected for shrinkage 


TABLE II 


INTER-CORRELATION MATRIX AND DISCRIMINANT FUNCTION FOR THE *FOUR PART SCORES ON THE 
LAE BETWEEN 86 ANGLO AMERICAN AND 62 APACHE INDIAN YOUTH 


Multiple Correlation Regression 
The LAE, Part Scores X. X, Ха XQ Ігу 2ry Sry Apache Indian Youth Anglo American Youth Total Group 

Se a eS Ны eS Sc ES сатса... s TE EDU. E cea 

1. Part IV-Laissez Faire Xa %1000 -520 -331 257 257 -028 003 M 11.12 8. 66 9.97 
SD 4.43 4.94 4.62 

2. Part III-Democratic Хз -520 1000 -333 -549 (-549) 000 (031) м 17.67 25.66 21.74 
SD 5.33 1.04 6.89 

3. Part II-Autocratic Submissive X, -331 -333 1000 276 276 (093) 000 M 12.42 10.31 11.34 
SD 3.61 3.73 3.83 

4. Part I-Autocratic Aggressive* M 8.99 5.32 7.03 
SD 3.88 2.91 3.50 

2 
* Fourth part eliminated because of experimental dependency R, 301401 310050 311011 pois. .547 + . 058 


** Decimal points omitted жөні 296479 300263 296246 
*** Corrected for shrinkage 
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of these Apache youth were considerably higher, 
and that the referenced Otis sc ores depicted a lit- 
erary and cultural handicap. 

The control group was made up of 100 ninth grade 
students (50 male and 50 female) fÉrom two of the 
Phoenix high schools in Arizona, who were equated 
by sex for I.Q. and age. Their age ranged from 13 
to 16 years with a M or 14.70 and a SD of 0.71 for 
boys, and a М of 14.33 with a SD of 0.45 for the 
girls. In terms of intelligence, as measured by the 
California Test of Mental Maturity, theI.Q.'s rang- 
ed from 68 to 122, with a M for boys of 102-97 and 
a SD of 7.66, and with а M for girls of 103.03 and 
a SD of 7.15. 


Findings 


In terms of leadership values, and as measured 
by scores on the ГОТ, there were nostatistically 
significant differences between the Apache and Anglo 
American youth. See Table I. A discriminant func- 
tion, as described by Horst (4), which incorporates 
a multiple point bi-serial correlation and regression 
procedure, was accomplished using the six part 
Scores, with only the first five part scores being 
included so as to eliminate the effect of excess ex- 
perimental dependency, between the Apache Indian 
and Anglo American youth (Table 1). A multiple 
point bi-serial R of .088--. 083 was obtained. Simi- 
larly, a discriminant function was utilized to com- 
pare Apache sex differences, and no statistically 
significant differences were obtained (R - .000). 

In terms of characteristic leader ship decision 
patterns, and as measured by the L AE, a statistic- 
ally significant difference was obtained between the 
Apache and Anglo youth, as illustrated in Table II. 
The Horst discriminant function yielded a multiple 
point bi-serial R of .547 + . 058. In terms of part 
test scores, this difference may be further de- 
Scribed as : 1) Apache youth are morelaissez faire, 
more autocratic aggressive, and more autocratic 
submissive than the Anglo youth; while 2) Anglo 
youth are more democratic and cooperative than 
their Apache peers. There were no statistically 
significant sex differences among the Apache youth 
in terms of the characteristic leadership decision 
patterns (R = .203 + . 114). 

Scores from both leadership tests for the Apache 
youth were correlated with thefollowingdataby use 
of a Pearson r, and as indicated in Table III: 1) age 
to nearest six months, 2) I. Q., 3) Stanford A- 
chievement Test scores, and 4) LQT vs. LAE total 
Scores. In general, there was a significant rela- 
tionship between the age of the youth and both of the 
leadership test scores, with the older youth making 
the better scores. Similarly, the youth with the 
higher I. Q.'s made the better test scores on both 
leadership tests. Apache youth with the better a- 
chievement test total scores made the better scores 
on the LAE, but there appeared to be no difference 


onthe LQT. Apache youth who did well on one 
leadership test tended to do well on the other lead- 
ership test (significant at the 5 percentlevel of con- 
fidence). : 


Summary 


This study was concerned with making a compar- 
ison on scores for two leadership tests between 100 
Anglo American youth in two high schools in Phoe- 
nix, Arizona and 62 Apache Indian youth in the sec- 
ondary schools at Globe, Arizona. The Apache 
youth continue to live on the San Carlos Indian re- 
servation where they were born and reared. 

The two leadership tests utilized were: 1) The 
Leadership Q-sort Test by R. Cassel, published by 
Psychometric Affiliates, and 2) The Leadership A- 
bility Evaluation by R. Cassel and E. Stancik, pub- 
lished by Western Psychological Services. The 
first test is concerned with assessing one's values 
or philosophy related to leadership; while the latter 
is concerned with assessingone's pattern or struc- 
ture for influencing others. 

No significant difference was obtained between 
the Apache Indian youth and the Anglo American 
youth on the scores for the LQT, but a significant 
difference was obtained on the scores for the LAE. 
The difference in terms ofthe LAE is characterized 
by 1) Apache youth being more laissez faire, more 
autocratic aggressive, and more autocratic submis- 
sive, while 2) the Anglo youth are more democratic 
or cooperative. 

No sex differences were obtained on either test 
for Apache youth. The older and more intelligent 
Apache youth did better on both leaders hip tests; 
while the better achievers in school did better only 
onthe LAE. Scores from both tests were related 
at the five percent level, indicating that Apache 
youth who had better values inleadership had better 
patterns for influencing others. 
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TABLE III 


COMPARING THE LQT AND LAE TOTAL SCORES WITH OTHER DATA (N - 148) 


————————Є—Є&——Є—Є—Є—Є—Є——Є—————————————— 


The Leadership The Leadership 
Other Data Q-sort Test Ability Evaluation 
Total Score Total Score 
ттт тс ЕЕ АА 90 9B ит мекен салер ем лала a A a AR ASN 
Age (to nearest 6 months) **. 205 жж, 534 
1.0. (from Otis group test) *.172 жж, 219 
SAT (total score from battery) . 062 жж, 209 


The Leadership Ability Test (total score) *.166 


Ж Anr of . 159 and better is statistically significant at 5% level. 
** Ап г of .208 and better is statistically significant at 1% level. 
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tember, 1950, Vol. 15, No. 3. therapy Theory and Research, edited by Mow- 
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RELATIONSHIPS BETWEEN ASPECTS OF 
PREPARATION AND MEASURES OF 
PERFORMANCE OF SECONDARY 
TEACHERS OF THE SOCIAL STUDIES 


JOSEPH R. ELLIS 
Indiana State College 
Terre Haute, Indiana 


VARYING POINTS of view exist regarding 
practically every aspect of education in the United 
States; however, there appears tobegeneralagree- 
ment that the position of the teacher is paramount in 
the learning situation. With the assumption that this 
is a valid concensus, the determination of the rela- 
tionship of the preparation and performance of teach- 
ers of the social studies becomes a real challenge. 


Problem 


The purpose of the study was to discover 
relationships between aspects of the preparation of 
secondary teachers of the social studies and meas- 
ures of their teaching performance. Four common- 
ly held hypotheses relating to the performance and 
to the preparation of teachers were investigated to 
determine differences between outstanding teachers 
and average.and below average teachers regarding 
their pattern, quantity, and quality of preparation. 


Procedure 


The literature (1, 3) includes reports of a 
considerable amount of research in whichtwo ex- 
tremes of teaching performance have been compared 
to determine the degree of presence of various 
characteristics and factors which might be predic- 
tive of teaching success. Тһе Study reported here 
sought to compare teachers who were designated as 
being outstanding withthose who were designated as 
being less than outstanding; thelatter group, there- 
fore, reflected a much broader range of teaching 
ability. 


Seventy secondary (grades 7-12) teachers 
of the social studies were selected on the basis of 
common criteria from six urban Texas school dis- 
tricts which were consideredto exemplify sound ed- 
ucational practices. Data concerning, preparation 
were obtained from the teacher's official transcript 
on file in the superintendent's office. Оп the basis 
of procedures described below, teachers were des- 
ignated as outstanding teachers, Group A, and as 
average and below average teachers, Group B;thus, 
two groups were estanlished and held constant a nd 
used for comparison purposes throughout the study. 
If the reader is willing to assume that the groups 
compared were randomly drawn, whieh strictly 
Speaking they are not, the t-test for the significance 
of the difference between the means of two groups 
can be used. Where appropriate, the chi-square 
test of significance was also employed in treating 
the data. The study was completed in the fall of 
1959. 


Findings 


Teachers were grouped on the basis of their 
building principal's generaldesignation. Тһе 4е- 
pendability of the principal'S generalized rating of 
the teacher was examined by an investigation of Hy- 
pothesis I of the study: 


Teachers of the social studies who are rated by 
their principals as being outstanding are demon- 
strably better teachers than social studies teach- 
ers whom these same principals rated as being 
average and below average teachers. 


Forty-four teachers were rated as being 
outstanding and formed Group A, while twenty-six 
teachers wereratedas being average and below av- 
erage and formed Group B. Responses to fourin- 
Struments were secured in the investigation of the 
hypothesis, Measures included a scale adapted from 
Kinney (2) for the building principal's rating of the 
teacher's professional competence a nd the princi- 
pal's ratings on a critical incidence check list of 
teacher behavior characteristics adapted from 
Ryans (4). Twenty students of each teacher also used 
the behavior characteristicscheck list to rate their 
teacher. Teachers themselves responded toa check 
list of professional problems encountered in their 
teaching and to the Minnesota Teacher Attitude In- 
ventory. 

Differences between the means of the two 
groups for both of the principal's detailed quantified 
ratings favored Group A and were statistically sig- 
nificant at the . 01 level of confidence. Student rat- 
ings and the teachers' responses onthe professional 
problems check list and on the M. T. A.I. also fa- 
vored Group A but lacked statistical significance. 
The data supported Hypothesis I, and it was con- 
cluded that the two groups were infactdifferent with 
respect to teaching performance and that this differ- 
ence assumed added importance whenone considers 
that the two groups were not selectedto be repre- 
sentative of extremes in social studies teacher per- 
formance. 

Having established that the two groups were 
different in teaching performance, an investigation 
was made into four aspects of Hypothesis II of the 
study: 


Teachers of the social studies ratedas outstand- 
ing by their principals have followed different 
patterns of preparation than have teachers of the 
social studies rated as average and below aver- 
age by their principals. 


It was found that: 


(a) as undergraduates a large r percentof the 
Group А teachers had declared majors inthe 
Social sciences than did the Group B teach- 
ers; however, the difference between the 
means of the two groups was not statistically 
Significant, 


the two groups differed only slightly with re- 
spect to the amount of preparation inthe so- 
cial science major, 


(b) 


the percent of GroupA teachers whocom- 
pleted student teaching was much greater 
than that of Group B teachers, but the dif- 
ference was again found to be lacking insta- 
tistical significance, 


(c) 


(d) as undergraduates, a larger percentof the 
Group A teachers were enrolled in other than 
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education departments or colleges than were 
the Group B teachers; however, here too, 
the difference was found to lack statistical 
Significance. 


Although a pattern of differences was noted, 
the data did not support the hypothesis in that sta- 
tistical significance for the differences between the 
means of the two groups for the four aspects of pro- 
gram pattern investigated was not found. 

The relationship between the quantity of prep- 
aration as expressed in terms of college semesters 
hours of credit and teaching performance was con- 
Sidered by an investigation of Hypothesis III of the 
study: 


Teachers of the social studies who arerated as 
outstanding have completed a greater quantity 
of preparation in their overallprogram,in gen- 
eral education, in the social sciences, and in 
professional education than have teachers of the 
Social studies whoare ratedas average and be- 
low average. 


In Table I a summary of the findings for Hypothesis 
Ш is presented. It was found that: 


(a) the two groups were very much alike with 
respect to the overall levelof preparation at- 
tained, 


in the area of undergraduate general educa- 
tion, the Group A teachers completed a se- 
mester hour mean which was greater than 
that of the Group B teachers; however, the 
difference was not statistically significant, 


(b) 


(c) in the area of Socialscience preparation, the 
mean amount of overall preparation (under- 
graduate and graduate) and the mean amount 
of work in history, government, soc iology, 
and economics was greater for the teachers 
in Group A than for the teachers in Group В, 
although, for none of these comparisons was 
the differences between the means of the two 
groups found to be statistically significant, 
(d) in the area of professional education, the 
mean amount of overall work (undergraduate 
and graduate), the mean amount of prepara- 
tion in student teaching, and the semester 
hour mean for student teaching completed in 
the social studies was greater for the Group 
А teachers than for the Group B teachers; 
for only one of the comparisons between the 
mean differences was statistical significance 
found, that being for the amount of semester 
hours completed and for the number of teach- 
ers completing student teaching in the social 
studies. 


For only one of twelve specific comparisons made 


6 THE JOURNAL OF EDUCATIONAL RESEARCH 


TABLEI 


А SUMMARY OF COMPARISONS OF ASPECTS OF THE QUANTITY (SEMESTER HOURS) 
OF PREPARATION FOR GROUP A AND GROUP B TEACHERS 


M —M — M ——————————»—— 


Area of Semester Amount Favored 
Preparation 


General 


Hour Ву Difference 
Mean Between Means 


Group A (N - 44) 


t-ratio 


P 


Semester Amount Favored 
Hour By Difference 
Mean Between Means 


Education 58.29 3.99 1. 45 >, 05 54.30 
Social Sciences 
Overall 44,97 7.51 1.16 >.05 37.46 
History 19.56 1.37 .39 >.05 18.19 
Government 11.25 .87 .35 >.05 10.38 
Economics 3.75 .49 .48 >.05 3.26 
Sociology 6. 88 3.50 1. 93 2.05 3.38 
Professional 
Education 
Overall 36.29 3.58 1.10 7.05 32.71 
Graduate 13.56 1.45 .52 2.05 12.11 
Undergraduate 22.72 2.07 1.70 >.05 20.65 
Student 
Teaching 3.68 .92 1.54 >.05 2.76 
Student Теасһ- 
ing іп Social 
Studies 2.23 1.54 3.39 «.01 .69 


Group B (N = 26) 


S 


| 
| 
| 
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TABLE II 


A SUMMARY OF COMPARISONS OF ASPECTS OF THE UNDERGRADUATE QUALITY 
(GRADE-POINT AVERAGE: A = 4.0, В = 3.0, С = 2.0, D = 1.0) 
OF PREPARATION FOR GROUP А AND GROUP B TEACHERS 


{< 
Group А (М = 44) Group B (N = 26) 


Mean Mean 
Area of Grade Amount Favored Grade Amount Favored 
Preparation Point Ву Difference t-ratio P Point By Difference t-ratio P 


Average Between Means Average Between Means 


Overall 
Undergraduate 2.65 .05 .457 >.05 2.60 ---- --- --- 


General 
Education 2.51 .19 .014 2.05 2.32 ---- Siete nr 


Social Sciences 
Overall 
Undergraduate 2.68 . 03 .19 >.05 2.65 ---- теле 7179017 


History 2.73 --- fom E 2.77 .04 .218 >.05 
Government 2.65 «1. . 850 >. 05 2,48 ---- ates e 


Professional 
Education 
Overall 
Undergraduate 2.90 .16 1.352 >.05 2.74 ---- а o ыы 


Exclusive of 
Student 
Teaching 2.85 719 1.320 >. 05 2.66 vU А "Тш miae 


Student 
Teaching 3.36 .18 - 776 >. 05 3.18 EV SUME eU 
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between the mean differences inGroup А and Group 
B concerning the aspects of the quantity of prepara- 
tion investigated was statistical significance found; 
however, the patternofdifferences favored Group А 
for each comparison which must be considered as 
lending support to Hypothesis III. 

The relationship between the quality of 
teacher preparation as expressed in terms of the 
undergraduate college grade point average and teach- 
ing performance was considered by an investigation 
of Hypothesis IV of the study: 


Teachers of the social studies who are rated as 
outstanding have earned a higher quality of un- 
dergraduate preparation in their overall pro- 

- gram, in general education, in the social sci- 
ences, and in professional education than have 
teachers of the social studies who are rated as 
being average and below average. 


In Table II a summary of the findings for Hypothesis 
IV is presented. It was found that: 


(a) the Group A teachers earned a mean grade 
point average for overall undergraduate 
preparation which was slightly higher than 
the mean earned by the Group B teachers; 
this difference was not statistically signifi- 
cant, 


(b) the teachers comprising Group A achieved a 
higher quality of preparationingeneraledu- 
cation than did the teachers in GroupB, but, 
here too, the difference between the means 
of the two groups lacked statistical signifi- 
cance, 


(c) theteachers in Group A earnedahigher mean 
grade pointaverage for overall undergraduate 
study in the social sciences and for prepara- 
tion in government than did the teachers in 
Group B; a converse relationship was found 
to exist for work inhistory, however, for none 
of these comparisons was the difference be- 
tween the means of the two groups statisti- 
cally significant, 


(d) in the area of undergraduate preparation in 
professional education, the Group A teachers 
earned a mean grade point average which ex- 
ceeded the mean of the Group B teachers with 
respect to overall work in education, prepa- 
ration in education exclusive of student teach- 
ing, and for student teaching; again, none of 
the differences between means were found to 
be statistically significant. 


The data failed to substantiate the hypothesis, butas 
was found in the investigation of Hypotheses II апаш 


of the study, a pattern of differences was observed 
which strongly favored the outstanding teachers, 
Group А. 


Summary and Conclusions 


The study sought to determine relationships 
between aspects of the preparation of secondary 
teachers of the social studies and measures oftheir 
teaching performance. А group rated as being out- 
standing teachers was compared with a group rated 
as being average and below average teachers. T he 
findings of the study were generally inconclusive, 
When considered independently, the differences be- 
tween the means of the two groups being compared 
with reference to each of the components of teacher 
education investigated were found to lack statistical 
significance in all but one instance; however, the 
Group А teachers were favored by nineteen of the 
twenty mean differences; thus, the general pattern 
of the differences between the means of the two 
groups overwhelmingly favored Group A, the out- 
Standing.teachers; moreover, this pattern assumes 
added importance when we again note that the two 
groups were not selected to be representative of ex- 
tremes in teaching performance. 

The data obtained for Hypothesis I indicated 
that the groups were in fact different in teaching per- 
formance. In terms of the total results secured for 
Hypotheses П, Ш, and IV, it would seem that pat- 
terns of variables involved іп teacher preparation 
may be more closely related tothe professionalper- 
formance of the teacher of the social studies thanis 
any individual variable. There would appear to be 
Strong support for the hypothesis that none of the 
factors herein considered are determinitive in pre- 
dicting outstanding teachers of the socialstudies and 
that some of the **Q"' factors other than those tested 
in the study are predictors of teaching success inthe 
Social studies. 
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OF CITIZENS ADVISORY COMMITTEES* 
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CITIZENS ADVISORY committees for the public 
Schools, though not new, have experienced a phe- 
nominal growth during the past decade. 

In the first nationwide survey, which was con- 
ducted in 1949, Hull (3) found only 62 citizens com- 
mittees (one of which dated as far back as1919). 
This study revealed, however, a very dramatic in- 
crease in the number of committees immediately 
following World War II as indicated by the factthat 
70 percent of the committees found by Hull were or- 
ganized during the years 1946-1948. By 1956, only 
Seven years later, the National Citizens Council for 
Better Schools estimated that there were app roxi- 
mately 10, 000 citizens committees in active opera- 
tion. 

Although substantial progress has been made in 
clarifying the functions of citizens committees, 
many administrators, board members, and citizens 
are still puzzled about the proper role of a citizens 
advisory committee. Questions frequently asked 
are: 

1. Should the committee be sponsored by the 
board or be completely independent? 

2. Should it be a short-range, specific purpose 
committee or а continuing committee? 

3. Is it more appropriate for a committee to 
work on relatively objective matters, suchas build- 
ing programs or to study curriculum problems that 
involve more intangibles ? 

4. Does a committee actually improve school- 
community relations ? 

5. Is a committee worth all the time it takes? 


Procedure 


This study was confined to those 678 school dis- 
tricts in New York State which maintained twelve 
grades in their public school program and was con- 
cerned only with school-district-wide lay citizens 
advisory committees that were established ona 
purely extra-legal basis. This investigation did not 
include any agriculture, vocational, or adult educa- 
tion advisory committees since these are required 


by law in New York State. 

The study was divided into two major parts: (1) 
four intensive case studies of citizens committees 
in central New York State, and (2) astatewide ques- 
tionnaire survey. 

Data gathering by use of the techniques of obser- 
vation and interview were combined to form the ba- 
sis for the case studies of citizens committees at 
work in the four New York communities. Two of 
these communities were small villages, a third was 
a city of approximately 40, 000 and the fourth com- 
munity was a suburban district adjoining a city of 
over 100,000 population. Thetwo small village 
communities had one board-sponsoredcitizens com- 
mittee each while the city district and the suburban 
district had two committees each, one board-spon- 
sored and one independent. 

The statewide questionnaire survey consisted of 
two phases: 

1. A preliminary questionnaire was sent to each 
of the 678 school districts in New York State that 
operated grades kindergarten through twelve. Re- 
plies were received from 546 districts which means 
there was ап 80. 5 percent response; a total of 222 
districts had active citizens committees between 
July 1, 1957 and June 30, 1958. 

2. The second questionnaire was mailed approx- 
imately one year after the firstto allof the 222 dis- 
tricts which had an active committee during the 1957 
-1958 school year; one-half of the second mailing 
went to chief administrators and one-half to citizens 
committee chairmen. There was a fifty-nine per- 
cent response to the second questionnaire. Since 
65 replies came from administrators and 66 from 
committee chairmen, many comparisons were made, 


Findings 


The statements listed below are the major find- 
ings of the study: 

l. Citizens advisory committees are found ex- 
tensively in New York State and their activities are 
significant for school administration. The fact that 


* Unpublished Ph.D. Dissertation, Cornell University, February, 1961 
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there were 222 active citizens groups during one 
given year, July 1, 1957 to June 30, 1958, means 
that one out of every three schools in the state had 
a committee during that year. There were 137 other 
districts that had a committee previously, but not 
during 1957-1958, making a combined total of 359 
which had a citizens advisory group either during 
1957-1958 or previously. Therefore, 66 percent of 
the districts in the state responding to the survey 
have had experience with advisory committees. 

2. Committees are found more frequently in dis- 
tricts with a population of over 10,000. Of the 158 
districts with a population of 10, 000 or more which 
participated in the survey, 87 percent had a citizens 
committee either during the year or previously. By 
contrast, only 47 percent of those communities with 
populations under 5, 000 had had a committee. 

3. The specific-purpose, board-sponsored citi- 
zens committee is the predominant type in the State 
of New York. There were 222 active committees in 
1957-1958, of which 168 were specific-purpose, 
board-sponsored compared to 36 continuing board- 
sponsored and only 18 independent committees. The 
median length of operation for the specific commit - 
tees was approximately eight months while the me- 
dianfor continuing committees was nearly two years. 

4. The great majority of citizens committee ac- 
tivity is centered aroundbuilding programs and bond 
issues. In 1958, a building program was most fre- 
quently listed as the chief objective by all three 
types of citizens committees: 1) specific, 2) contin- 
uing, and 3) independent. In the case of specific 
committees, a building program was indicated as 
the most important purpose in more cases than 
were all other chief objectives combined. 

5. Citizens committees as a group are general- 
ly successful in achieving their objectives. 

a. A great majority of citizens committees, ap- 
proximately 72 percent, were rated as either 
excellent or above average on a five-point 
Scale by both administrators and citizens 
committee chairmen. Table I shows the ad- 
ministrator's rating of the com mitee in his 
district in 1958 in response to the first ques- 
tionnaire compared with the same administra- 
tor's rating of the same com mittee again in 
1959 after a lapse of a year. Table П shows 
a comparison of the ratings given by the ad- 
ministrator in 1958 with the rating giventhe 
same committee one year later by the chair- 
man of the committee. It should be remem- 
bered that when the second questionnaire was 
sent, one-half went to chairmen and one-half 
went to administrators. Since the administra- 
tors and chairmen were located in different 
School districts, it was unlikely that any col- 
lusion would occur. 

b. Chairmen and administrators alike reported 
that school-community relations had either 
been improved or greatly improvedas a result 
of committee activity in 80 percent ofthe 


cases. Eighty-five percent of chairmen and 
administrators indicated they would recom- 
mend that their boards of education continue 
to sponsor citizens groups. 

c. Eight citizens groups recommended centrali- 
zation (consolidation of school districts) 
during 1957-1958 of which seven passed and 
only one failed. 

d. Other tangible evidence of success includes: 1) 
Seventeen committees recommended curric- 
ulum changes which were made. 2) Fifteen 
recommended new policies which were a- 
dopted, and 3) five recommended new salary 
scales which were accepted. 

6. Advisory committees experience some diffi- 
culties in their operation. The problems most fre- 
quently mentioned as the chief difficulty experienced 
by the committee in the order of their importance 
were: 1) Most of the work of the committee was left 
to be done by a few, 2) Membership was not truly 
representative of the entire school district. 3) 
Membership became divided into factions, and 4) 
the committee was not given or it didnot take e- 
nough time to do the job well. 


Recommendations 


Some have looked upon citizens committees as 
the long-sought panacea for a multitude of school 
problems associated with public relations, need for 
increased financial support, policy making, and 
curriculum construction; some have viewed these 
groups with a suspicious and critical ey e and have 
taken the position that they will inevitably usurp the 
authority and the responsibilities of the school board 
or become another ‘‘pressure group’’ seeking sup- 
port for their narrowly conceived projects; still 
others see in the citizens committee idea an oppor- 
tunity to practice ‘‘psuedo-democracy”’ in which the 
leader is able to use subtle and devious means of 
obtaining from the group the ‘‘right’” recommenda - 
tions. 

Hamlin (2, p. 1) stated in 1952 that he had some 
fear that ‘‘the wide-spread and haphazard develop- 
ment of citizens groups may lead to mistakes and 
excesses that will discredit these groups and kill a 
healthy movement for the use of school-initiated 
committees that has been developing for a genera- 
tion.’’ It seems clear that the citizens committee 
movement сап be a healthy movement if it can avoid 
the excesses and mistakes of which Hamlin spoke. 

Citizens committees should: 

1. Be more concerned about the spirit rather 
than the form of democracy in committee operation. 

The results of extensive observations of commit- 
tees at work and many interviews with participants 
seem to indicate very definitely that citizens com- 
mittee members are far more concerned about an 
opportunity for free discussion of issues than they 
are about following a particular democratic form. If 
the participants feel certain that their ideas will be 


O'NEAL 31 


TABLE I 


CORRELATION OF ADMINISTRATOR RATING OF COMMITTEE SUCCESS ON 
FIRST QUESTIONNAIRE WITH AL MINISTRATOR RATING OF 
COMMITTEE SUCCESS ON SECOND QUESTIONNAIRE* 


кш————————Є——Є—ЄЄЄ——Є—Є{[{Є{]у]у{ЄүуЄүЄ—ЄүуЄ—Є]{]Є]Є&Єү—Є&{уЄ—Є&Єү&Є&ү—Є&Є&Єү—Єүй&Є&Є&ф&ф©фү— 


First Questionnaire Second Questionnaire Rating 
Rating Unsuccessful Poor Average Above Average Excellent Totals 


——————— ÁHÁÁ——Ó———————————————————— 


Excellent 0 0 2 5 15 22 
Above Average 1 0 4 8 Кі 20 
Average 0 0 5 3 0 8 
Poor 0 1 0 1 0 2 
Unsuccessful 4 1 2 0 0 7 


No Answer, 6 


Totals 5 2 13 17 22 59 


Coefficient of Correlation .74 


ж The way to interpret Table I can be illustrated by the number “2” which is found under 
the word ‘‘average’’ at the top of the third column of figures. This indicates that 
two administrators rated the committees in their districts as “ауегаре” on the sec- 
ond questionnaire, when they rated the same committees as ‘‘excellent’’ one year 
earlier on the first questionnaire. 


TABLE П 
CORRELATION OF ADMINISTRATOR RATING OF SUCCESS 
WITH CHAIRMAN RATING OF SUCCESS 
M ——————— 
Chairman Rating 


Administrator Rating Unsuccessful Poor Average Above Average Excellent Totals 


Excellent 0 0 3 5 10 18 
Above Average 1 1 " 7 8 24 
Average 0 0 0 4 8 12 
Poor 1 0 1 0 1 3 
Unsuccessful 0 0 1 0 0 1 


No Answer, 8 


Totals 2 1 12 16 27 58 


Coefficient of Correlation .23** 


** An inspection of Table II shows that there is more general agreement between the two ratings 
than is indicated.by the correlation figure. Statistically speaking, there were not enough 
ratings that fell in the lower end of the scaleto bring out the*'straight line” relationships. 
It is interesting to note that 72 percent of the administrators ratedthe committees in their 
districts as excellent or above average and 73 percent ofthechairmen rated their commit- 
tees as either excellent or above average. 
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treated with dignity and respect, they are not likely 
to be disturbed if the chairman of the committee is 
appointed rather than elected. 

2. Be appointed only after it is certain that ad- 
vice is wanted. 

If decisions have already been m ade or if there 
is a likelihood that the recommendations made by 
such a group would be undesirable or unacceptable, 
a citizens committee should not be appointed. Com- 
mittee members resent being “manipulated” or 
having their recommendations ignored. 

3. Operate within theframework of policies 
concerning advisory committees established by the 
boards of education. 

It seems quite unfortunate that only ten percent 
of the boards of education in the districts partici- 
pating in the statewide survey had established any 
written policies relating to citizens groups. 

4. Be given a suggested time limit, of reason- 
able length, during which they are to finish their 
work. 

The statewide survey showed that 59 percent of 
the citizens committees were not given a suggested 
time limit and of this group, 58 percent received a 
success rating of excellent orabove average. How- 
ever, in those cases where a target date was set, 
the portion receiving a success rating of excellent 
or above average was 81 percent, which was an in- 
crease of 23 percentage points over those commit- 
tees without a suggested time limit. 

5. Be composed of members who serve as indi- 
viduals rather than as representatives of community 
organizations. 

The boards appointed the members of the com- 
mittees which they sponsored in 80 percent of the 
cases; when members were appointed by the board, 
they served as individuals. Atthe presenttime, the 
literature is practically unanimous in recommend- 
ing that members of advisory com mittees partici- 
pate as individuals. 

6. Bea good cross-section of the community 
and should include some **opposition'' in its mem- 
bership. 

In two communities, А and D, where intensive 
case studies were made, the comm ittees failed to 
accomplish their objectives, and the leaders in both 
communities were convinced that the failure to in- 
clude some opposition on the committees contribut- 
ed importantly to their failure. 

". Have sub-committee reports made exclusive- 
ly to the executive committee or to the citizens 
committee as a whole, but nottothe general public. 
The citizens committee should not make public any 
reports or recommendations until after the board 
has had ample opportunity to study them. 

Bretsch (1, p. 76), McCloskey (4, p. 383), and 
Ross (5, p. 504) support this recommendation. 
Community C had an independent committee that vi- 
olated this rule and Community D had a board-spon- 
sored group that also ignored this principle; this 


procedure brought serious difficulties in both cases. 

8. Attempt to achieve wide agreement among 
committee members before making final recommen- 
dations. Minority reports are to be avoided if at 
all possible. 

The questionnaire survey res ults indicated that 
70 percent of the committees were able to obtain be- 
tween 85 and 100 percent agr e e ment with the final 
recommendations and thatthere was a definite rela- 
tionship between committee success and the number 
of committee members agreeing with the final re- 
port. 

Of all the committees responding to the question- 
naire survey, 93 percent had wisely refrained from 
presenting a minority report. 

9. Be board-sponsored rather than independent. 

Of the 222 citizens committees in New York State 
which were active during1957-1958, only eightper- 
cent were operated independently of the board of ed- 
ucation. In the two-year period of 1957 and 1958, 
forty-three specific-purpose, board-sponsored com- 
mittees were formed in districts which had never 
had a committee before, but duringthe same period 
only two independent committees were formed. Thus 
practice leans very heavily in the direction of board- 
sponsored committees. 

In no way is it being suggested that boards of ed- 
ucation should set up committees which will be 
their “rubber stamps.’’ If the committee is not to 
be given real problems and real responsibilities, it 
should not be organized in the first place, Further, 
it is recognized that in some emergencies such as 
when the board of education is unable or unwilling 
to perform its duty, an independent citizens com- 
mittee is in order; however, when the problem has 
been solved, the committee should disband and let 
the board of education, the legally responsible, and 
in most cases the democratically elected group, 
manage the regular school affairs of the district. 
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PRINCIPAL BIAS IN RATING TEACHERS 


ROBERT LORD and DAVID COLE* 
Occidental College 
Los Angeles, California 


Introduction 


IN A PREVIOUS paper Cole (1)has reported 
the findings of a five year study conducted to deter- 
mine if material obtained from certain measures of 
personality was useful in predicting success in ele- 
mentary and secondary teaching. The measure of 
success dealt with inthat paper was the rating given 
each teacher by a classroom observer. It was noted 
in that paper that a rating was also given each 
teacher by the principal, but that such ratings were 
So severely skewed to the favorable end as to make 
the data unsuitable for analysis. The purpose ofthe 
present paper is to report on the results of an at- 
tempt to evaluate the validity of this skewing, and 
then to evaluate the relationship between principal 
ratings and the prediction of teaching success which 
had been made from the personality test data. 

At the time of the collection ofthe original 
data, each principal was interviewed and asked to 
place the teacher under discussion into one of five 
quintiles, ranging from the top to the bottom 20 per 
cent of other teachers of comparable experience, 
known to the principal. Uponthe accumulation of 
the principal responses, it was apparentthat the 
quintile placements were not only generally skewed 
toward the high end, but that this was particularly 
true in certain school districts which played an im- 
portant role in the study. 

Teachers inover a dozenschooldistricts were 
evaluated during the study. These districts varied 
greatly in size and in proximity to the training in- 
Stitution making the study. Analysis of the data 
made it apparent that the skewing of principal rat- 
ings was particularly evident in certain large key 
districts, located close to the training institution 
involved here. This is documentedin Table I, 
District “A” is a large district immediately adja- 
cent to the teacher training institution. District 
“В” is also a large district, fairly close to the 
training school. АП of the other districts partici- 
pating are lumped in the third column. 


А study of the predictions which had been made 
for the teachers in the various districts, or their 
performance in practice teaching, gave no basis for 
believing that superior prospects had entered dis- 
trict A, althoughthere was some support for the be- 
lief that those entering district B were more prom- 
ising than average. There was nobasis, either 
Írom principal interview material, from interviews 
with the teachers themselves, or from knowledge of 
curriculum planning and philosophy to believe that 
teachers entering certaindistricts had been ‘‘tailor- 
made” in terms of training for that particular dis- 
trict. Beyond this, moreover, it was apparentfrom 
a study of the qualitative remarks made by the prin- 
cipals during the interviews that in both districts А 
and B there were often what appeared to be large 
discrepancies between whata principal said and the 
quintile ranking he eventually assigned. In certain 
instances the nature of the criticism were suchthat 
it would have been anticipated thata low quintile rat- 
ing would be forthcoming, only to have it emerge 
otherwise as the interview progressed. 

The same person did all of the interviewing 
throughout the study, and was not associated either 
with any school district, or with the depart- 
ment where the teachers had been trained. The 
principals did understand, however, thatthe person 
represented the training institution in question. 


Method 


From the above it was hypothesized that the 
principals, particularly in districts А and B had of- 
ten given higher quintile placements than a more 
objective appraisal would warrant. Before consid- 
ering the relationship between prediction and prin- 
cipal rating therefore, it seemed wise to attempt to 
gain a further principal appraisal. 

In an effort to get this reappraisal, the coop- 
eration of the central offices of districts A and B 
was obtained, together with the cooperationof anoth- 
er educationalinstitution, notinvolved in teacher 


*The authors wish to acknowledge the Rosenberg Foundation, San Francisco, California, which provided finan- 


cial support for this study. 
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TABLE I 


PRINCIPAL'S RATINGS BY LOCATIONS 


Quintile District A District B Others 


% % % 

5 (top) 61 46 28 

4 19 34 31 

3 13 20 26 

2 4 0 8 

1 4 0 7 
TABLE П 


TEACHER RATINGS BY THREE METHODS 


Rating by 
Teacher Predicted Rating by Forced 
Rating Interview Choice 
1 4 4 4 
2 4 5 5 
3 3 3 1 
4 2 3 4 
5 3 4 3 
6 5 5 5 
7 4 5 3 
8 3 5 5 
9 2 5 1 
10 1 3 3 
11 3 3 2 
12 2 4 3 
13 5 5 4 
14 5 5 5 
15 3 4 2 
16 2 3 1 
17 1 5 4 
18 4 4 4 
19 3 3 3 


TABLE Ш 


RATINGS BY BOTH METHODS FOR TEACH- 
ERS IN EACH PREDICTED QUINTILE 


Mean Rating Mean Rating, 
Predicted N Interview Forced Choice 


Quintile Method Method 
5(tp) 3 5.00 4.67 
4 4 4.50 4. 00 
3 6 3.67 2.67 
2 4 3.75 2.25 
1 2 4.00 3.50 
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training. Тһе files of the central offices were 
studied and twenty nine teachers selected, who had 
previously been rated in principal interviews. In 
each case four other teachers at the same school, 
of nearly equivalent training and experience were 
identified. Care was taken to see that these teach- 
ers had received their teacher training at a variety 
of other institutions, and none at the same training 
institution as that of the teachers in the mainstudy. 

Each principal then received the names of the 
five teachers selected, with the request that he rank 
them on the basis of several criteria, suchas grasp 
of subject matter, classroom control, and relations 
with students, parents, other teachers and admin- 
istration. А final over-all ranking was requested. 
As noted previously, this letter came from an in- 
stitution not directly involved in teacher training, 
and was presentedas a means of acquiring informa- 
tion regarding the importance of various attributes 
of a prospective teacher. The approachand criteria 
were similar to that used in the inverviews. 


Results 


Nineteen of the 29 teachers were ranked in 
this manner by their principals. It willbe seenthat 
this task required a forced choice quintile assign- 
ment, rather than the open quintile assignment 
called for in the interview. The results for each of 
the 19 subjects are presented in Table II. The first 
13 listed were in district А and the other six 
in district B. 

It will be noted that eight of the teachers 
received the same quintile placement in both 
instances, while ten were ranked lower and one 
higher by the forced choice method. This appears 
to support the idea that the ratings assigned by in- 
terview had been spuriously high. Data regarding 
the impact of these changes on accuracy of predic- 
tion is presented in Table III. 

From Table Ш it may be seen that accuracy 
of prediction was improved by the forced choice 
method for all but one of the five quintiles. Since 
so many of the teachers were placed in the top quin- 
tile by the interview method, itwas nearly inevitable 
that accuracy would be greatest there by that meth- 
od. 

In the paper by Cole previously mentioned, ac- 
curacy of prediction was judged in part by the per- 


centage of ‘‘reversals, ” that is, the frequency with 
which a teacher predicted as above average, was 
rated below average, andvisa-versa. Cole reported 
that when success was based on the ratings of a 
classroom observer, predictions from personality 
test data resulted in 11 per cent reversals, sub- 
stantially below the incidence from more conven- 
tional criteria. 

When success was based onprincipal ratings, 
the percentage of reversals for 85 subjects teaching 
in other than districts A and B was 18 per cent. 
For the 19 subjects in districts Aand Bonwhom 
two ratings were available, there were 15 per cent 
reversals when the rating was given in interview, 
and 10 per cent when the rating was of the forced 
choice variety. Thus again, the forced choice 
ratings appear to be more closely related to pre- 
dictions than those given in interview. 

Even the figure of 18 per cent reversals for 
the ‘‘uncorrected’’ sample of 85 teachers is better 
than reported by Cole on the basis of more conven- 
tional, academic material. Thus itappears that 
predictions based on personality material, accord- 
ing to the interpretation method described in the 
previous paper, constituted a promising approach 
to the prediction of teacher success, although to a 
lesser extent when the criteria of success was a 
principal’s rating. 


Summary 


In a study to see if data from personality tests 
was useful in the prediction of success in teaching, 
an obstacle was encountered when ''success"' was 
defined in terms of principal ratings. This wastrue 
because these were skewed to the favorable end of 
the distribution, especially in certain districts. 
Employing a forced choice method for evaluation 
improved prediction. Prediction from the person- 
ality data was more accurate than that from more 
conventional academic data. 
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SOME CORRELATES WITH THE MINNESOTA 


MULTIPHASIC PERSONALITY INVENTOR Y 


WILBUR L. BROTHERS 
United States Armed Forces Institute, Madison, Wisconsin 


IN VIEW OF the continued interest inthe Minne- 
ota Multiphasic Personality Inventory, the corre- 
ations of the various scales of the inventory with 
ertain other variables may be of interest. These 
orrelations are part of a more comprehensive 
tudy which was made at Indiana University and re- 
orted in abstract form in 1950 (1). 

The correlations which follow were based upon a 
tudy of 93 student teachers who were enrolled in 
tudent teaching at Indiana University for the first 
emester 1949-1950, at the secondary level. Stu- 
ent teachers in physical education, health, and re- 
reation were not included in the study. The sam- 
le represented approximately 35 percent ofthe to- 
al student-teacher population atthe secondary lev- 
1 for the semester concerned. 

The sample was made up of 46 men and 47 wo- 
ien. The mean age of the subjects under investi- 
ation was 22.91 years, with a range of 20 years to 
6 years. Forty-one, or 44. 09 percent, ofthe stu- 
ent teachers were veterans. Sixty-nine ofthe stu- 
ent teachers in the study, or 74. 19 percent, were 
rawn from cities, towns, and villages of 1,000 
opulation or over. Twenty-four, or 25.81 percent, 
ame from open rural areas or from small com- 
iunities of less than 1,000 population. The Holle- 
ith technique was used in the codification of the 
ata in order to obtain the sum of the squares and 
е cross products of all the variables used in the 
tudy. The Pearson product-momentformula for 
omputing coefficients of correlation was used. 
he zero order correlations were calculated by use 


of an automatic electronic calculator. The corre- 
lations of the variables in the study with the scales 
of the Minnesota Multiphasic Personality Inventory, 
Revised Edition, are shown in Table I. 

Since high scores on the scales of the Minnesota 
Multiphasic Personality Inventory are purported to 
be indicative of the specified neurotic tendency, the 
predominence of negative correlations in the table 
with more desirable attributes seem to be accounted 
for. For the number of cases used in this study, à 
coefficient of correlation of . 256 is required for 
significance at the five percent level (2). 

The intercorrelations of the neurotic tendency 
Scales of the Minnesota Multiphasic Personality In- 
ventory are shown in Table II. A relative high cor- 
relation between any two scales may indicate a lack 
of discrimination between the two scales. In other 
words, the two scales in question may be measuring 
the same attribute. 

The following correlates are presented as sup- 
plements to studies of the Minnesota Multiphasic 
Personality Inventory. 
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TABLEI 


THE CORRELATIONS OF THE VARIABLES IN THE STUDY WITH THE SCALES ON THE 
MINNESOTA MULTIPHASIC PERSONALITY INVENTORY* 


Interest Scale 


Variable Minnesota Multiphasic Personality Scales 
MMHS MMD MMHY MMPD  MMMF MMPA  MMPT MMS МММА 
eee 
ACHIEVEMENT 
High school rank -. 26 -.20 -.26 -.32 -.12 -.07 -.04 -.21 -.25 
Grade point average at end of two semesters -.19 -.01 -.12 -.15 -.01 .05 04 -.27 -.18 
Grade point average at end of four semesters -.29 -.03 -.21 -.22 -.01 .02 02 -.32 -.21 
Grade point average in education courses -.14 -.01 -.07 -.12 03 -.02 -.10 -.20 -.21 
Grade point average іп major at end of six semesters -.34 -.12 -.17 -.30 -.03 .01 -.08 -.31 -.23 
GENERAL ABILITY -.03 -.19 -.04 -.14 04 -.16 -.18 -.19 -.09 
READING COMPREHENSION 
Vocabulary -.07 .14 -.02 -.14 08 -.13 -.15 20 01 
Reading Speed .06 -19 07 -.05 04 .05 22 19 -.05 t 
Level of Comprehension .04 -.21 02 -.03 05 .01 -.18 -.17 03 тш 
o 
INTERPRETATION OF DATA E 
ti 
General -.30 -.14 -.09 -.10 01 .09 -.16 -.29 -.15 g 
Probability True-False -.16 -.15 0.07 -. 20 07 .07 -.18 -.27 -.01 
Insufficient data -.17 -.06 -.17 07 -.06 .05 -.07 -.07 -.20 
True-False -.24 -.08 -.01 -.14 -.02 .11 -.23 -.26 -.25 
Caution :13 .11 -.01 25 -.11 .01 28 34 -.14 
Beyond data .21 .10 -.02 -.02 07 -.14 06 07 -.14 
Crude errors .18 .11 -.01 14 01 -.06 21 23 26 
GENERAL CULTURE 
Current Social Problems -.04 -.12 -.07 -.05 13 -.11 -.09 -.11 .04 
History and Social Studies .06 -.04 03 -.27 10 -.02 -.02 -.01 -.05 
Literature -.07 -.11 02 -.03 -.01 .03 -.15 -.09 -.02 
Science -.04 .01 01 01 10 -.03 25 -.11 -.10 
Fine Arts -.15 -.18 -.19 -.19 .02 -.03 -.16 -.05 .12 
Mathematics -.16 .01 -.01 -.08 .02 -.04 -.14 -.09 -.16 
* Explanation of Minnesota Multiphasic Personality Scales Code: 
Hypochondriasis Scale MMHS Paranoia Scale MMPA 
Depression Scale MMD Psychasthenia Scale MMPT 
Hysteria Scale MMHY Schizophrenia Scale MMSC 
Psychopathic Deviate Scale MMPD Hypomania Scale MMMA 
MMMF 
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TABLE II 


INTERCORRELATIONS OF NEUROTIC TENDENCY SCALES OF THE MINNESOTA 
MULTIPHASIC PERSONALITY INVENTORY N - (46 MALES - 47 FEMALES) 


H 

gm 

———————————————————————————ÀMáá— Ei 
Scale Code HS D HY PD MF PA PT sc MA 5 

E 

Hypochondriasis MMHS 1.0 .24 .59 .31 .07 .06 .21 .40 .18 B 
о 

Depression MMS 1.0 .25 4.45 .09 .22 .51 .36 -.004 ІЗ 
т 

Hysteria MMHY 1:0 .23 .03 .22 .13 а .18 S 
a 

Psychopathic Deviate MMPD 1.0 -.04 .27 .35 .4T .26 5 
o 

Interest MMMF 1.0 -.01 .01 -.03 -.03 Е 
Paranoia MMPA 1.0 .32 519 -.01 ü 
n 

Psychasthenia MMPT 1.0 263 122 E 
Schizophrenia MMSC 1.0 .35 E 


Hypomania MMMA 1.0 
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WORD DISCRIMINATION DRILLS 


GEOFFREY P. MASON 
Victoria College, British Columbia 


THE ABILITY to aiscriminate small differences 
between words has been shownto be significantly re- 
lated to spelling ability (4, 9). An improvement in 
word discrimination ability has been obtained 
through the use of drills. The increased word dis- 
crimination ability was accompanied by an increase 
in spelling ability (7). 

The type of drill exercise material which should 
prove most effective in developing good discrimina- 
tion of words now warrants consideration. Тһе na- 
ture of the perceptual ability involved in the dis- 
crimination of word forms isstillindispute. Those 
who view word discrimination as a specific factor 
tend to deny the possibility of effective general 
training with non-verbal material (2, 5, 6, 10). 
Those who view word discrimination as a facet of a 
general perceptual skill emphasize the transfer- 
ability of generalized training particularly by the 
use of tachistoscopic techniques (1, 3, 8). The ex- 
perimental findings on perceptual training are con- 
flicting. 

The purpose of this study was not to investigate 
the nature of the perceptual ability involved in word 
discrimination, but to attempt to devise a practical 
tool for the improvement of visual discrimination of 
words in the classroom. Тһе identical elements 
theory of transfer suggests that drill exercises in- 
volving real words would be most effective particu- 
larly if the words were taken from the spelling cur- 
riculum which was later to be tested. On the other 
hand, nonsense material often ap p ears more diffi- 
cult to discriminate because familiar patterns are 
absent and, therefore, may contribute more to the 
development of a visual discrimination skill by com- 
pelling the subject to perceive more carefully. The 
visual discrimination of geometric designs and other 
non-verbal forms has yet to be shown to be closely 
related to spelling achievement. 

Two types of drill material consequently were 
devised. Type A consisted of real words taken from 
the spelling curriculum for Grade IV and words of 
similar difficulty. Type B consisted of nonsense 
words. Thirty drill sheets of each type of material 
were constructed. Each sheet consisted of fifty i- 
tems. The first twenty-five required the identifica- 
tion of a pair of words or letter groups as being the 
same or different (e.g. hand - hand; geus - gues). 


The next twenty items requiredthe selection from a 
row of four alternatives those words which were the 
same as the first word of the row. The last five i- 
tems required the underlining of words in a row 
which contained exactly the sameletters asthe first 
word of the row. The drills contained the common 
perceptual errors of omission, substitution, rever- 
Sal, and misplacement of letters. 


Subjects and Method 


The subjects used were Grade IV students in ele- 
mentary schools in an urban district of British Co- 
lumbia. From the district twelve classes were 
randomly selected and assigned to groups A, B and 
C. 

Spelling Test 1, consisting of forty words taken 
from the curriculum in Grades II, Ш and the first 
month of Grade IV, was administered to the three 
groups. 

The groups then received the following treat- 
ments: 

Group A--A Monday drill exercise which used the 
words in the weekly spelling unit and a Thursday 
drill exercise which used words of approxim ately 
50 percent difficulty for grade four obtainedfrom 
the Buckingham Extension of the Ayres Spelling 
Scale. After twelve weeks of drills, theMonday 
exercise only was continued for another six weeks 
to make а total of thirty drills over the whole period. 

Group B--The same as Group A except that all 
drills consisted of nonsense material. 

Group C--A control group which received no 
drill exercise. 

At the termination of the drill period, all groups 
received Spelling Test 2, consisting of forty words 
taken from the spelling curriculum under study dur - 
ing the drill period. 

Data from both spelling tests were obtained from 
the following number of subjects in each group: А, 
115; В, 133; С, 119. The percentage of cases lost 
through absence or transfer were for Groups A, B, 
and C, respectively, 23, 15, and 19. 

An analysis of covariance design was usedto ad- 
just for the initial differences on Spelling Test 1 a- 
mong the three groups. The difference between ad- 
justed means was evaluated by the critical ratio, t. 
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TABLE I 


SPELLING PERFORMANCE BEFORE AND AFTER 
THE EXPERIMENTAL PERIOD 


Group N Mean of Mean of Adjusted Means S.D. of 
Test 1 Test 2 of Test 2 Test 2 
A 115 30.44 32.96 31.53 6. 68 
B 133 28.14 32.91 33.26 7.12 
с 119 27.98 27.45 28.01 8.88 


Results 


The mean scores of the groups for Spelling Tests 
land 2 and the adjusted means for Test 2 are given 
in Table I. 

The t ratios obtained from testing the signifi- 
cance of the differences between the adjusted means 
for spelling of the three groups and thelevels of 
significance are as follows: 


Structed to serve subjects at different levels of de- 
velopment, irrespective of thespelling curriculum 
under study. 
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Discussion 


The purpose of this study was to examine the ef- 
fectiveness of two types of drill material. The gains 
made by Group B are significantly greater at the 
‚05 level of confidence to those of Group A. How- 
ever, the relative difficulties of raising a mean of 
30. 44 to 32. 96 and 28. 14 to 32. 91 cannot be accur- 
rately assessed from the data. Nevertheless, it 
appears safe to assume that the nonsense material 
is at least as effective, if not superior, to the ma- 
terial employing real words. The subjects іп Group 
А, who used real woras for drill purposes, had 
greater exposure to the words later to be tested than 
did those in Group B, who used nonsense material. 
The effectiveness of the latter type of material may 
have resulted from an apparently greater difficulty 
inherent in the discrimination of nonsense material, 
which assisted the development of a higher level of 
discriminatory ability. 

The demonstrated equivalence of the nonsense 
material is of practical value for the classroom 
teacher. If it had been found that words in the spell- 
ing curriculum made the best material for drill ex- 
ercises, a continual change in drill material would 
be necessary with changing grades, c hanging cur- 
riculum, and different supplementary words. The 
findings of this study suggest that material innon- 
sense form is at least equally satisfactory for the 
purpose of developing visual discrimination of words. 
A series of graded exercises could thus be con- 
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LEARNING DISABILITY IN RELATION 


TO FAMILY CONSTELLATION 


ALLEN HODGES and BRUCE BALOW 
University of Minnesota 


PERSONALITY theorists have long maintained 
that an individual's birth position with regard to 
siblings is a significant factor in personality devel- 
opment (1). Recent studies by Koch (3, 4, 5, 6, 7, 
8) and Schachter (10) add substance to such specu- 
lation. While Schachter's study is primarily con- 
cerned with ordinal position (first born vs. others), 
Koch introduces additional variables: sex of subject, 
sex of sibling, age differences between subject and 
sibling in two-sib families. Тһе present study at- 
tempts to explore certain of these factors with re- 
spect to the area of school learning disabilities. 

The assumption is frequently made that a learn- 
ing disability occurring in a child of normal intel- 
lect is an outgrowth of social and/or emotional mal- 
adjustment (2, 9). If such istrue and if social and 
emotional development appear related to family 
constellation, then the study of learning disability 
in relation to family constellation would appear to 
be a fruitful area of research. 

Based on the Koch and Schachter findings, a ten- 
able hypothesis would be that first-borns tend to be 
in a more psychologically stressful situation and, 
therefore, should demonstrate more learning disa- 
bilities than their younger siblings, regardless of 
sex of sibling. With the known accelerated devel- 
opmental growth of girls as compared to boys, one 
could further hypothesize that a male child with a 
sister would be in a particularly stressful situation 
regardless of whether the sister is older or young- 
er. The rationale from which this hypothesis stems 
is that intra-family comparisons are common and 
the faster development of girls, particularly their 
more ready acquisition of school skills, tends to 
make those comparisons, whether by the child or 
others, unfavorable to the male thus creating stress. 

Formally stated, the hypotheses tested in this 
study are: 

1. In two-sib families, learning disabilities 
tend to occur more frequently in the first-born male 
regardless of sex of sibling. 

2. In two-sib families, a male child with either 
an older or younger sister should show a high inci- 
dence of school learning difficulty. 


Selection of the Sample 


The Psycho-Education Clinic of the University of 
Minnesota, College of Education, has been in exis- 
tence since 1947. Over 1600 childrenhave been re- 
ferred because of crippling learning disabilities. 
Reading disabilities account for about 80% of the 
total case load. The ratio of male referrals to fe- 
male referrals is approximately five to one. 

The majority of the referrals arefrom the urban 
Minneapolis-St. Paul area. Socio-economic status 
of the referred children tends to be above average. 
АП referrals are voluntary on the part of th e par- 
ents. 

From a perusal of the 1600 cases seen atthe 
Clinic, 261 male subjects were selected, the selec- 
tion based on the following criteria: 

a) Both the father and mother arepresent in the 
home. 

b) Each subject has only one sib. 

c) The age difference between subject and sib 
does not exceed plus or minus six years. 

d) The subject has at least normal intelligence 
as evidenced by an IQ of 90 or above on ап individ- 
ually administered intelligence test. 


Findings 


The absence of base rate data as to the propor- 
tion of male and female offspring in two-sibling 
families presents many hazards to research within 
this area. Until base rate data are available, it ap- 
pears parsimonious to assume that two-sib families 
follow the sex distribution of boy -boy, boy-girl, 
girl-boy, girl-girl, corrected for the known birth- 
rate differences of 104 boys per 100 girls. 

By the selection procedure (selectingonly males 
referred to the Clinic because of learning di sabili- 
ties) twice as many female siblings as male siblings 
of subjects would be expected if the assumption of 
normal sex distribution in two-sibling families is 
used. 

Sex and ordinal position of the siblings of the 261 
subjects distribute as follows: (observed frequencies 


42 THE JOURNAL OF EDUCATIONAL RESEARCH 


are followed by expectedfrequencies in parentheses. 
The expected frequencies are based on normal sex 
distribution corrected for birthrate differences.) 
Male siblings totaled 108 (91. 63), with 56 (45.81 of 
them older than the subject and 52 (45.81) younger; 
female siblings totaled 153 (169. 34), with 82 (84. 67) 
older and 71 (84. 67) younger than the subject. 

By means of Chi Square, no significant differ- 
ence was found to exist between the ordinal position 
of the sibling and the subject experiencing learning 
difficulties. The hypothesis that school learning 
disabilities tend to occur more frequently in the 
first-born male of a two-sib family mustbe reject- 
ed. 

As to the second hypothesis that a male with a 
sister should show a significantly higher incidence 
of school learning difficulties, a Chi Square of 
4. 5013 was obtained when male sibs vs. female sibs 
were compared. This difference is significant at 
less than the 5% level; however, the difference is in 
opposition to the hypothesis stated. Boys with 
learning difficulties tend more often to have bro- 
thers than they do sisters. 


Discussion 


The results of this study must be interpreted 
with caution. First, there is an absence of base 
rates as to sex distribution in two-sib families. 
Secondly, while the sample selection procedure al- 
lows a ‘‘clean’’ test of the hypothesis, it restricts 
the generalizability of thefindings. However, with- 
in these limitations, the finding that schoollearning 
disabilities do not occur more f requently in first- 
born males lends no support to the hypothesis that 
first-borns are in a particularly stressful situation. 
The statistically significant finding that males ex- 
periencing learning difficulties tend more often to 
have brothers than sisters reiterates the already 
known fact that learning disabilities are much more 
likely to occur in boys than in girls. 


Summary 


From 1600 children referredto a psycho-educa- 
tional clinic because of learning disabilities, 261 
male subjects were selected from two-sibling fam- 
ilies. The age difference between subject and sib- 
ling did not exceed plus or minus six years. АП 
subjects were average orabove average in measur- 
ed intelligence. 

An attempt was made to test the hypothesis that 
learning disabilities occur most frequently in first- 
born males. In addition, the hypothesis was tested 


that a male with a sister (older or younger) should 
demonstrate a high incidence of learning disability, 
Both of these hypotheses were rejected. From the 
results of the study, it seems doubtful that ordinal 
position is related to the incidence of school learn- 
ing difficulties. Because of the highly selected na- 
ture of sample includedin this study, this question 
should be tested in normal school situations. 

Having a sister and the effect of this upon the 
disabled male learner in atwo -sib family appears 
to be without significance in the sample studied, 
The significant finding that males with learning dif- 
ficulties tend to have more brothers than sisters is 
interpreted to reiterate the already known fact that 
learning difficulties occur more frequently in two- 
boy families as compared to boy-girl families. 
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Book Reviews 


CHARACTERISTICS OF TEACHERS, THEIR DE- 
SCRIPTION, COMPARISON, AND APPRAIS- 
AL: A Research Study, David С. Ryans. 
Washington, D. C.: American Council on Ed- 
ucation, 1960. xxiii + 416 pp. 


This report is a scholarly document. Ryan 
stresses the point that qualities of a good teacher 
are not absolute but are instead interacting traits 
varying intheir merit and dependent upon education- 
al philosophies, pupil characteristics, course level, 
content and otherfactors. In addition, Wood, intro- 
ducing the report, com ments that those who are 
looking for a quick and easy method for selecting 
youngsters who will make good teachers, or who 
want a simple device for screening teachers at the 
point of employment or promotion, will not find the 
answer to their questions in this report. However, 
certain types of teacher traits have been identified 
that are considered significantly related to teacher 
success in a wide variety of situations. Paper and 
pencil inventories have been developed for experi- 
mental use in appraising these traits. Nevertheless, 
the results are not inaform usable to school admin- 
istrators and others who work with student person- 
nel unless they areresearchspecialists. This need 
is to be met by subsequent (still unplanned) publica- 
tions. 

The attention ofthe study was directed toward 
the identification and estimationof some of the ma- 
jor patterns of teacher characteristics underlying 
teacherbehavior. Two hypotheses were formulated. 
First, that teachers may be described in terms of 
their observable behaviors and self-expressed opin- 
ions, viewpoints and ty pical responses in defined 
situations, and second, that devices utilizing corre- 
lates of behavioral dimensions may be de velo ped 
for the prediction of such characteristics. 

This report took six years to complete with ap- 
proximately one hundred separate res earch projects, 
six thousand teachers, seventeen hundred schools, 
and four hundred fifty school systems participating. 
Extensive classroom observation of teachers by 
trained observers was carried out for the purpose 
of discovering significant patterns of teacher behav- 
ior. 

The study was carried forward in four major 
phases. First, instruments were developed for use 


in classrooms refining observational methods to in- 
crease the reliability ofthe observation. The criti- 
cal incident technique was used by the observers. 
(This involves the judgment of the observers in both 
the choice of good and poor teachers as well as the 
choice ofincidents. Second, similar major patterns 
of teacher behavior in the classroom were deter- 
mined. Afactorialanalysisofthe correlations helped 
determine the major patterns of observed teacher 
behavior (at both elementary and secondary level). 
In the third phase, a paper and pencil instrument was 
developed consisting of items hypothetically related 
to teacher-classroom behavior and other person- 
al characteristics of teachers, including empirical 
derivationof scoring keys for such instruments, This 
instrument became the classroom observation rec- 
ord. Finally, this record, after the observers had 
been trained in its use, was used to evaluate teach- 
ers onaseven-intervalscale. There were four qual- 
ities to describe pupil behavior and eighteen to de- 
Scribe teacher behavior. The terms are inferential 
and bi-polar, such as fair, democratic, responsive, 
and understanding. Each quality defined in behav- 
ioral terms thus became a criterion for the person- 
al prerequisites to teacher effectiveness as inferred 
from the observations of teacher behavior. At no 
time іп the study were the criteria related to pupil 
achievement. The author gives several reasons in 
defense of his criteria and why a pupil gain criterion 
should not be used. 

The primary uses conceived for the teacher 
characteristics study were: 1) for school systems 
as an aid inidentifyingteachersat the time of selec- 
tion for employment or perhaps in connection with 
promotion in identifying teachers who have charac- 
teristics similar to those deemed important and de- 
sirable by the particular school system and the cul- 
ture it represents; and 2) for teacher education in- 
stitutions as an aidto better understanding of teach- 
er characteristics and associated conditions to con- 
tinue to im prove procedures for selecting teacher 
candidates and to improve professional courses and 
curriculums. Caution, however, is suggested in the 
application of the results and conclusions reported 
in the summary, in that the findings are inductive 
inferences from empiricaldata and that the general- 
izations are appropriate only when made to popula- 
tions which are essentially similar to the population 
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employed in the study. Necessarily, all conclusions 
are approximate ratherthan exact and are probabil- 
ity estimates rather than state ments of certainty. 
A final caution is that greater confidence can be 
placed in conclusions whenthey are applied to groups 
of teachers than when they are applied in individual 
cases. 

The study appears to have been very careful- 
ly carried out, but each step involves many subjec- 
tive judgments. If it were repeated a number of 
times and a high consistency were found and the re- 
sults correlated with pupil achievement, the ap- 
proach might be very profitable. 

Jacob O. Bach 
Southern Illinois University 


RESEARCH DESIGN AND ANALYSIS, Edited by 
Raymond O. Collier and Stanley M. Elam. 
Second annual Phi Delta Kappa Symposium on 
Educational Research. Bloomington, Indiana: 
Phi Delta Kappa, 1961. vii + 208 pp. 


The main body of the symposium is composed 
of seven papers as follows: Experimeritationin Ed- 
ucation: Particularly Large Scale Cooperative Ex- 
perimentation with Mathematics Curriculum, Paul 
E. Rosenbloom; The Design of Sample Surveys, Les- 
lie Kish; The Sampling Problem in Educational Re- 
search, Francis G. Cornell; The Design and Anal- 
ysis of Experiments with Some Reference to Edu- 
cational Research, Oscar Kempthorne; Psycholog- 
ical Aspects of Research Design, S. B. Sells; and 
Studying Status vs Manipulating Variables, Julian C. 
Stanley. These papers were discussed by Charles 
E. Gates, Associate Professor and Agricultural Ex- 
periment Station Statistician; CyrilJ. Hoyt, Associ- 
ate Professor of Education; Leonid Hurwicz, Pro- 
fessor of Economics; James J. Jenkins, Professor 
of Psychology; Eugene A. Johnson, Associate Pro- 
fessor of Biostatistics; E. Scott Maynes, Assistant 
Professorof Economics; Richard B. McHugh, Asso- 
ciate Professor of Biostatistics; David S. Pulermo, 
Assistant Professor of Child Development; Paul C. 
Rosenbloom, Professor of Mathematics; E. L. 
Lindquist, Professor of Education, State University 
of Iowa, chairman. 

The symposium is worth more than one read- 
ing of any serious student of Research Design and 
Analysis. It contains, besides the many excellent 
discussions of the technicalaspects of the subject, 
much scientific wisdom. The volume contains clar- 
ifying statements onmany things such as: the choice 
and definition of populations; sampling techniques; 
methods of measuring, collecting, and processing 
data; sample design; design and analysis of experi- 
ments; research design and value systems; frames 
of reference; and the concepts of systems and inter- 
related factors. There is a particularly good dis- 
cussion of the importance of situational factors in 
learning and teaching and the importance of defining 


and studying the interrelationships among significant 
aspects of the situation. Teacher and pupil behay- 
ior must be considered in relation to important as- 
pects of the situation. While not new, there isa 
good discussion of the relationship between samples, 
parent populations and target populations. An issue 
arose in the discussion over whether available pop- 
ulations might be treated as random samples of un- 
defined populations representative of the groups 
studied. There was also a number of references to 
the non-additive nature of certain types of data, a 
point frequently overlooked in research design and 
analysis. Throughout the discussion one will find 
much illustrative material including a detailed dis- 
cussion of the Minnesota Mathematics Experiment. 
The volume was dedicated to the late Palmer 0. 
Johnson. 

The participants are tobe congratulated for an 
interesting and informing discussion, and Phi Delta 
Kappa should be congratulated for sponsoring such 


a vital series. 
A. S. Barr 


University of Wisconsin 


HOW TO TELL THE SCHOOL STORY, Leslie W. 
Kindred and Associates. Englewood Cliffs, 
N.J.: Prentice-Hall, 1960. xii+ 500 pp. 


The pressure on the school to compete effec- 
tively with a wide array of claimants for an increas- 
ing slice of the taxpayer’s dollar has brought into 
sharp focus in recent years the imperative impor- 
tance of clear, continuous, candid communications 
between our institutions of primary and secondary 
education and their supporting publics. This pres- 
sure has produced a rash of brochures and texts on 
how to tell the school story. 

This book by that apt name is one of the best. 
Dr. Kindred and his colleagues have crammed into 
500 pages as practical a handbook of public relations 
technique as is available in any form for the knowl- 
edgeable school man. It is no reflection on the fun- 
damental excellence of this book to point out that it 
has a number of inherent flaws which render it 
something less than the definitive work in its field. 

First, the book is essentially the work of à 
committee, and as such it has a certain lack of co- 
herence and a component of duplication. 

Second, it is long on assorted practices and 
shy on basic principles, perhaps in deference to Dr. 
Kindred's companion book, School Public Relations. 
One would particularlylike to see a more extensive 
treatment of coordinated PR planning. 

Third, it draws its inspirations and examples 
largely from the metropolitan school system, to an 
extent that the small-town superintendent may be 
more dismayed than challenged by its array of big- 
time media and methods. 

C. A. Schoenfeld 
University of Wisconsin 
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SOCIAL INTEGRATION IN URBAN COMMUNITIES: 
A Guide for Educational Planning, Robert B. 
Knapp. New York: Bureau of Publications, 
Teachers College, Columbia University, 1960. 
ix + 196 pp. 


If public schools try, they can do something 
constructive about community integration. This 
thought is disturbing in view of the disintegrative 
forces currently at work in urban areas. Either 
many urban school systems are not making the ef- 
fort—or they areandthe resultsare meager. A new 
book inthe Teachers College Studies in Education 
takes the position that community integration re- 
quires more purposeful educational effort than has 
been made. Onthe assumption that certain integra- 
tional factors in communities andthe social functions 
of public education are related, integration in ur- 
ban society as documented by theo retical and em- 
pirical studies is examined. 

Six sociological studies are analyzed and six 
questions for education are formed and answered. 
The studies are Middletown, Black Metropolis, 
Yankee City, Steeltown, Regional City, and Park 
Forest. The subsequent educational int errogation 
is as follows: 


Q.  Whatis the proper position of public schools 
regarding conflicting values and interests? 

A. Insist on being free of subservience to eco- 
nomic or political power groups. 


Q. What is the role of the public schools in has- 
tening the process of racial-ethnic integra- 
tion? 

A. They must hasten the process. 


Q. What part can public schools play in counter- 
acting or minimizing the di visive effectsof 
Social class bias? 

A. Individual merit must be placed abo ve pres- 


tige. 


Q. What part can public schools paly in counter- 
acting or minimizing the divisive effects of 
social class bias? 

A. Individual merit must be placed above prestige. 


Q. What is the school’s function amid conflicts 
resulting from differences among organi zed 
economic groups? 

A. Teach people to consider total community well- 
being. 


Q. What is the proper relationship of the school 


to the power structure? 
Train for broad participation in community 
planning and policy formulation. 


Q. What is the suburban public school's integra- 
tive role in meeting problems of increasing 


mobility, specialization, and dislocation? 
A. More general education, inthe problem- 
centered, value-oriented sense, 


This assignment is not a new one for public schools 
—be free of bondage to selfish interests, ре common 
to all, place merit above prestige, build a concern 
for total community well-being, train for broad par- 
ticipation in public policy formulation, and educate 
within the context of problems and values. However, 
the complexity and disintegrative drift of urban life 
give this assignment an urgency not formerly felt. 

Angell’s measures of integration are cited by 
Knapp as ways of arriving quantitatively at conclu- 
sions regarding progress toward integration. These 
measures are indices based on welfare effort and 
crime rate. It seems to the reviewer that such in- 
dices may very wellbe high-low. i.e., high for wel- 
fare effort and low for crime—in a high conformity 
monolithic society, which doesn’t establish anything 
about integration except possibly that the best way 
to deal with it is through larger units of homogene- 
ity. If the goal is not monolithic, norms must be 
set accordingly and then improved within context. 
The reestablishment of norms certainly doesn’t ne- 
gate the highly desirable ideal of no crime and no 
essential material need. 

The point is made by Knapp that cultural inte- 
gration is not a singular phenomenon, but in relating 
it to democratic unity the latter is described as built 
on respect for individuality and recognition of differ- 
entiated goals. Perhaps, instead, the commonality 
and the consequent movement toward cultural inte- 
gration should rest on the way decisions affecting 
public policy are made rather than on the common 
grasp of a set of abstract beliefs in the intrinsic 
worth of persons апа the right to be different. Prop- 
er decisions properly executed are the works which 
prove the faith far better than incantation. 

There are several tiresome struggles of which 
this book brings no end tosight. One is the struggle 
to so structure education and consequently society 
that value and enduring meaning give integrative sig- 
nificance to knowledge and its uses. This is the pa- 
rochial vie w—education within a value framework. 
The position taken by Knapp regarding the key inte- 
grative role of public schools may be underminded 
by the widely held notion that parochial or “‘s pe c ial 
interest’’ schools can do this job, except in the sense 
of voluntarily cooperating with other agencies in 
planning and in program building. 

Another, not openly confronted by Knapp, is the 
struggle to circumscribe public school responsibil- 
ity to create knowledgeable, culturally sophisticated, 
highly literate persons. Totake little concern for 
human development other than intellectual will not do 
theintegrativejob. Yet if purposeful educational ef- 
fort is required, to which educational institution or 
agency other than public does one turn? This exag- 
gerated concernfor опе aspect of human development 
surely cannot give us the promise of history that it 
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will save even a monolithic culture from destruction 
or decay. How, then, can it offer hope to a polyge- 
netic culture striving to achieve the most delicate 
balance known to man—stability and a workable 
amount of integration while holding many values, 
many gods, and many goals. 

The problem is not to create a system of educa- 
tion having no common commitment as its goal; 
rather it is to create a system having such commit- 
ment which protects the widest latitude of individual 
personal, social, and economic maneuverability. 
Since any and every education must spring from a 


setting and be pursued within a setting it cannot be 
aseptic, i. e., purifiedof the taint of a viewpoint re- 
garding the desirable structure of society. The 
evaluation must then be made on the basis of its con- 
tribution to an open-door rather than closed-door 
Society. The world, it often seems, is filled with 
people who, in the name of education, want to close 
alldoors on answers or solutions to problems ex- 
cept one. 


Howard Wakefield 
University of Wisconsin 


Field News 


Readers are invited to send items of research interest to 
* John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 


PREMEDICAL SCHOOL EDUCATION 
А survey of Harvard College graduates attending American medical schools supports the 


growing belief that concentration on pre-medical courses to the exclusionofliberal arts subjects 
in undergraduate programs is unnecessary and, perhaps, undesirable for future medical students, 
In the first year of medical school, those who majored in pre-medical courses as undergraduates 
ranked above those with broaderbackgrounds. However, bythe third year, the liberal arts grad- 
uates were on equal standing with the pre- medical group and at times surpassed them. (From the 
Journal of Higher Education, Volume XXX, No. 6, June 1959.) 


SOCIAL CLASS STRUCTURE AND COLLEGE ATTENDANCE 

William Sewell, Professor of Sociology, University of Wisconsin, in a state-wide survey of 
Wisconsin high-school students found that sixty-three percent of the students' fathers were in 
professional and executive categories and that forty-one percent of those whose fathers were in 
white-collar jobs planned to attend college. Only twenty-eight percent whose fathers were skilled 
workers, twenty-two percent whose fathers were farmers, and twenty-one percent whose fath- 
ers were unskilled workers planned to do so. Professor Sewell states, “ТЕ our society wishes 
to step up the production of able young scientists, educators, and professional workers, there is 
a great reservoir of relatively untappedtalent to be found among the bright lower socio-economic 


status students. ” 


COMPARISON OF AMERICAN AND EUROPEAN EDUCATION 
In the December 24 issue of the Saturday Evening Post, an eighteen page study is reported, 


«Ів European Education Better than Ours?" Evan Hill and George Gallup report on tests and in- 
terviews of 4142 youngsters in the United States, England, Norway, West Germany, and France. 
Comparison of matched groups of ten and fourteen-year olds showed that Americans rated lower 
on most general information tests, spent less time in class and on homework, and misbehaved 
more in school. However, American children also showed stricter family discipline and more 
interest in higher education and in teaching. 


NEW POST FOR THOMAS N. STERN 
Thomas Noel Stern, former Director of the Division of Research and Statistics, Pennsyl- 


vania Department of Revenue, was inaugurated as President of the Westchester (Pennsylvania) 
State College. 


HARTLEY SIMPSON TO RETIRE 
Hartley Simpson will retire as Dean of the Yale University Graduate School at the end of the 


current academic year. He willrejoin the Yale history faculty after a year's leave for research 
and writing. 
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NEW MATHEMATICAL STATISTICS BOOK. FOR TEACHERS 
Howard Alexander, now at Earlham College, has written a new book, Essentials of Mathe- 
matical Statistics, oriented toward the needs of the teacher of statistical theory. His approach 
is basically intuitive, rather than abstract, and has eliminated gaps in the exposition in order to 
make the text as readable as possible. John Wiley and Sons, Inc., is the publisher. : 


AUDIO-VISUAL AID RESEARCH " v 
The Bureau of Social Science Research, Incorporated, Washington, D. C., has started a new 
study on **Audio-Visual Aids in Public Schools and Factors Influencing their Use. '' This nation- 
- wide survey will investigate the extent to which the public elementary and secondary schools have 
audio-visual equipment available, how it is used, and what trends can be identified or predicted. 
A second objective of the study will be an analysis of the effects of the attitudes of teachers and 


administrators and of the ad m in istrative organization of the school district on the use of these Я 


aids. Тһе project director is Eleanor P. Godfrey. 


RESEARCH PROBLEMS IN MATHEMATICS EDUCATION 
The Cooperative Research Program of the U. S. Department of Health, Education, and Wel- 
fare has recently prepared a monograph entitled Research Problems in Mathematics Education— 
Reports from a Conference of Psychological Problems and Research Methods.’’ Тһе papers in 
this monograph were edited and a manuscript prepared by project director Dr. Philip H. DuBois 
and the principal investigator, Dr. Rosaline L. Feierabend, Washington University. 


EMMETT ALBERT BETTS TAKES NEW POSITION , ) 
The University of Miami announces the appointment of Emmett Albert Betts as Research 
Professor in Education and Lecturer in Psychology. Dr. Betts assumes his new position on 
September 1, 1961. 


NEW YORK STATE EDUCATIONAL RESEARCH CONVOCATION 


An item of interest to many in northeastern United States should be the date of the second | 


New York State Educational Research Convocation. The convocation will be held November 13 
and 14 in Albany, New York. This convocation is sponsored jointly by the New York Education 
Department and the Educational Research Association of New York State. 


PRACTICAL TRAINING MORE IMPORTANT THAN RESEARCH? 

At a recent meeting of the Association of School Business Officials, the Board of Directors 
voted unanimously for a practical how-to-do-it approach for graduate students rather than the 
production and execution of a graduate study research report. Mr. Charles W. Foster, Editor 
of School Business Affairs, the official organ of this Association, intimates, however, that this 
viewpoint may be unfortunate. 


ACHIEVING BALANCE BETWEEN CLASSROOM TEACHERS AND SPECIALISTS 


The findings of a study by Merle B. Landerholm is reported in a recent issue of the Research | 


Bulletin of the Institüte of Administrative Research, Teachers College, Columbia University. , 
Landerholm found that a group of school systems receiving the highest scores on a measure of | 
School system adaptability had an average numerical staffing adequacy of 68 professional staf f 
members per thousand pupil units, that they employed an average of 18 professional specialists 


per thousand pupil units, that class size in their elementary schools averaged 26, and that class | 


size in their secondary schools averaged 22. Ofthese most ad aptable school systems, while 


tending to have the highest number of specialists, kept classes smaller than those systems with 
lower adaptability scores. 
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: It Says in the JER... 


WHAT A STUDENT-TEACHER BELIEVES about himself as a person can have a pro- 
nounced effect on his performance as a teacher, Hatfield believes after studying 
the self-concept theory of personality among a group of teachers’ college under- 
graduates in South Dakota. Students rated as superior student-teachers proved 
to have healthy, confident concepts of themselves. Students rated as inferior 
student-teachers generally were the victims of self-depreciation. Early identifi- 
cation of prospective student-teachers harboring serious doubts about themselves, 
and the provision of adequate counseling services might be expected significantly 
to influence the quality of student-teaching, Hatfield suggests. 


A THREE-LEVEL APPROACH to measurements of concepts associated with verbal 
symbols has potential utility as a means of evaluating students’ vocabularies in 
elementary school arithmetic, Chase reports. He hypothetizesthat this approach 
could be of value in measuring the status of children's concepts associated with 
terms specific to other academic areas as well. A teacher using an appropriate 
test could determine the stage of concept formation at which his students were 
operating and subsequently choose materials and adapt activities in accord with 
the extent to which the concepts of the class progress. 


A FRUITFUL MEASURE OF LEADERSHIP POTENTIAL may be an agreement with 
a normative concept of ‘‘ideal’’ leadership, Arkoff and Shears conclude from 
their study of successful and unsuccessful candidates for a principal-training | 
program in Hawaii. The accepted candidates held remarkably homogeneous, 
**middle-of-the-road'' conceptions of ideal leadership. The rejected candidates 
tended to take more extreme positions, although, on the average, both groups 
had comparable concepts of “consideration” and ‘‘initiating structure." There 
were no differences, incidentally, between males and females. 


IS A PERMANENTLY CERTIFIED ELEMENTARY SCHOOL TEACHER a better 
teacher than one who is only provisionally certified? LuPone answers with a 


qualified “уез” after studying 240 teachers in their first, second, andthird years 
of classroom experience in 120 elementary schools in the state of New York. 
**Permanent"' teachers were rated as more successful than **temporary" teach- 
ers infive categories: preparation, planning, and management; subject matter, 
pupil-teacher relations; evaluation; and instruction. No significant differences 
were revealed between the two groups in the categories of human relations and 
parent-teacher relations. 


THE HESTON PERSONAL ADJUSTMENT INVENTORY appears to appraise charac- 
teristics sufficiently different from those appraised by the Kuder Preference n 
Record, the California Mental Maturity Test, and the Progressive Reading Test 
to warrant its inclusion in a test battery designed for general advisement pur- 
poses. However, the correlations found among the Heston subscales are of suf- 
ficient magnitude so that the user of the Inventory may question whether or not 
several of the scales may be measuring manifestations of a single component. So 
suggest Kingston on the basis of a University of Georgia analysis. 


THE HIGH SCHOOL DAILY SCHEDULE in Midwestern secondary schools presents a 
maze of differing patterns and policies, but Sturges concludes that certain trends 
present advantages, at least in terms of staff and building utilization. Partic- 
ularly he finds that teachers in schools using an eight-period schedule, teach 
more classes and more students per week than teachers in schools with fewer 
periods. Likewise, classrooms are used for classes more heavily in the eight- 
period day. Teachers tend to like flexibility the eight-period day provides, dis- 
like the shorter class periods. 


ORAL READERS SCORED 5О MUCH HIGHER than silent readers in a series of tests 
among freshmen at San Jose City College that Collins thinks oral reading ought 
to get more attention in reading instruction where prose comprehension is the 
objective. He thinks the added motor and aural stimuli afforded the oral reader 
may explain why he does better. Incidentally, the Collins study hints the Flesch 
formula may not measure differences in reading ease with complete confidence. 
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А Comparison of Provisionally Certified and Permanently 


Certified Elementary School Teachers in Selected 


School Districts in New York State 


FOR THE past fifteen years, since World War 
П, the problem of finding competent, well qualified 
teaching personnel has grown increasingly acute. 
There have been many suggestions offered, and 
many plans tried. Perhaps the most widely used of 
these, is the granting of provisional certification 
to people whodo not meet educational requirements 
as established by the State Education Department of 
the State of New York. 

This study represents an effort to determine 
whether the provisionally certified elementary 
School teacher in the first, second and third year 
of classroom experience is as successful as the 
permanently certified elementary schoolteacher in 
the same years. It could serveasagauge to meas- 
ure the value of provisional certification and its 
place in our school systems. 

The teachers studied were from one hundred 
and twenty elementary schools in the State of New 
York. Of these one hundredandtwenty elementary 
Schools, sixty-four elementary schools were from 
residentialareas, thirty-four elementary schools 
were from agricultural areas, twenty-two elemen- 
tary schools were from industrial areas. Two hun- 
dred and forty teachers who were in their first, 
Second and third years of classroom experience 
participated in the study. The two hundred and forty 
teachers were comprised of the following: forty 
provisionally certified elementary school teachers 
and forty permanently certified elementary school 
teachers were in the first year, forty provisionally 
Certified elementary schoolteachers andforty per- 
manently certified elementary school teachers were 
in the second year, and forty provisionally certified 
elementary school teachers and forty permanently 
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certified elementary school teachers were in the 
third year. 

The study used the questionnaire-inquiry tech- 
nique. Sixty statements providing a basis for ratings 
from poor to superior were included in this instru- 
ment grouped under the following seven major areas 
considered important to teacher effectiveness: hu- 
man relations, preparation, planning and manage- 
ment, subject matter, instruction, parent-teacher 
relations, pupil-teacher relations and evaluation. 
The elementary schoolprincipals in participating 
Schools rated one provisionally certified elementary 
School teacher and one permanently certified ele- 
mentary school teacher on a one-to-one basis. 

The data were analyzed with respect to four 
variables, namely: (a) total population of more or 
less than 5000 in villages, towns and cities from 
which the questionnaire was returned, (b) the num- 
ber of years, either more or less thansevenyears, 
of administrative experience of the rater completing 
the questionnaire, (c) the number of years, either 
one, two or three, of teaching experience of the 
teachers being evaluated, and (d) the type of license, 
either provisional or permanent, the teacher held. 
The statistical design of this investigation may be 
described as 2x2x3x2 complex factorial design in 
which correlation for one variable was present. 

Some of the more significant conclusions and 
recommendations derived from an analysis and in- 
terpretation of the data are summarized. 


Area I - Human Relations 


As used in this study this term applies to a 
teacher's ability to work with pupil's, professional 
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TABLE I 
ANALYSIS OF VARIANCE FOR SCORES FOR AREA I HUMAN RELATIONS 


—Є—Є—Є—Є—є—Є{]ү—Є—ЄүЄ{ү]ү—ї]у—ү—ү—үй&—Є———————————————— 


Source of Sum of Mean 

Variation d. f. Squares Square F P Hypothesis 
(v9 ve аа datha sga te ы Ааа lS 1522-20 2 мч. о. лу... 
Between PEY 

Groups 11 537.88 48.90 1.47 2,05 accepted 
Between Judges 

in the Same 1 

Group 108 3,593.30 33.27 

Between Judges 119 4,131.18 

Between L 1 13.07 13. 07 >. 05 accepted 
Interactions 

L x PEY Groups 11 225.83 20.53 1.12 ».05 accepted 
Interaction: 108 1,981. 10 18.34 


Pooled Judges x 
L 


Within Judges 120 2,220. 00 


Total 239 6,351. 18 


The hypothesis tested was that there were no Significant differences between the means of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d. f. » the value of F re- 
quired for rejection was 3. 94. 
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TABLE II 


ANALYSIS OF VARIANCE FOR SCORES FOR AREA II 
PREPARATION, PLANNING AND MANAGEMENT 


TM ——— M ———————————————— 


Source of Sum of Mean 

Variation d. f. Squares Square F P Hypothesis 
aranan on o a aee a mL LEN RS NICO ca ava DN 

Between PEY 11 483. 93 43.99 1.43 2.05 accepted 

Groups 

Between Judges 108 3,325. 80 30.79 

іп the Same 

Group 

Between Judges 119 3,809. 73 

Between L m 317.39 317.39 16.89 <.01 rejected 

Interaction: 11 143.41 13.04 >.05 accepted 

LxPEY 

Interaction: 108 2,029.20 18.79 

Pooled Judges 

xL 

Within Judges 120 2, 490. 00 


Total 239 6,299. 73 


groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1,88. For land 108 d.f., the value of Е re- 
quired for rejection was 3. 94. 
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TABLE III 


ANALYSIS OF VARIANCE FOR SCORES FOR AREA III SUBJECT MATTER 


aa 


Source of Sum of Mean 

Variation d. f. Squares Square F P Hypothesis 
more UBER o Ep s SEO TE Gen 22-ы... 

Between PEY 11 613. 70 55.79 - >.05 accepted 

Groups 

Between Judges 108 7,137.20 66. 09 

in the Same 

Group 

Between Judges 119 7,750. 90 

Between L 1 232. 06 232.06 17.75 <.01 rejected 

Interaction: 11 168.24 15.29 2.05 accepted 

L x PEY Groups 

Interaction: 108 3,236.20 29.26 

Pooled Judges 

xL 

Within Judges 120 3,636.50 

Total 239 11,387. 40 


The hypothesis tested was that there were no significant differences between the means of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d. f. , the value of F re- 
quired for rejection was 3. 94. 
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TABLE IV 
ANALYSIS OF VARIANCE FOR SCORES FOR AREA IV INSTRUCTION 


к=ш——————————————Є——Є—Є——————Є—Є—Єк—ЄЄ—ЄЄ—ЄЄ—ЄЄ—Є———Єї]———————Єї&——— 


Source of Sum of Mean 

Variation d. f. Squares Square Е P Hypothesis 
Between PEY 11 1, 292. 53 117.50 1.79 2.05 rejected 
Groups 

Between Judges 108 8, 174, 20 65.69 

in the Same 

Group 

Between Judges 119 9, 466. 73 

Between L 1 608. 02 608.02 30.44 «4101 rejected 
Interaction: 11 191.78 17.43 2.05 accepted 
L x PEY Groups 

Interaction: 108 2,157.20 19.97 

Pooled Judges 

xL 

Within Judges 120 2, 957. 00 

Total 239 12, 423. 73 


The hypothesis tested was that there were no significant differences between the means of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d.f., the value of F re- 
quired for rejection was 3.94. Since the obtained F was 1.79, the hypothesis was rejected. 
It has been customary to investigate differences that are close to the established fiducial 


limit. 
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TABLE V 


TEST OF SIGNIFICANCE OF THE MAIN VARIABLES 
AND INTERACTION FOR THE SCORES FOR AREA IV 


Source of Sum of Mean 

Variation d. f. Squares Squares F P Hypothesis 
P 1 153.59 153.59 2.34 >.05 accepted 
E | 30.82 30.82 2.05 accepted 
Ж 2 874.85 437.43 6.66 <.01 rejected 

Interactions 
PxE 1 10. 43 10. 43 2.05 accepted 
PxY 2 109. 01 54.51 2.05 accepted 
ExY 2 82.88 41. 44 2.05 accepted 

PxExY 2 30. 95 15.48 2.05 accepted 

racine PEY 11 1,292.53 117.50 1.79 7.05 rejected* 

NUM 108 8,174.20 65. 69 

Between Judges 119 9, 466. 73 


Е ри зант Rr eer PD ——— H— 
The hypothesis was that the mean squares for the main variables and interactions were not signifi- 
çant ER QU Square for judges in the same group was used as the error term. 
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TABLE VI 
ANALYSIS OF VARIANCE FOR SCORES FOR AREA V PARENT-TEACHER RELATIONS 


Source of Sum of Mean 

Variation d. f. Squares Squares F P Hypothesis 
Between PEY 11 370.41 33.67 >.05 accepted 
Groups 

Judges in the 108 3, 654. 69 33.84 

Same Group 

Between Judges 119 4, 025. 10 

Between L 1 16.54 16.54 1.69 2.05 accepted 
Interaction: 11 95.51 8.68 2.05 accepted 
L x PEY Groups 

Interaction: 108 1,059.81 9.81 

Pooled Judges 

xL 

Within Judges 120 1,171.86 

Total 239 5,196. 96 


The hypothesis tested was that thers were no significant differences between the neans of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d.f., the value of F re- 
quired for rejection was 3. 94. 
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TABLE VII 
ANALYSIS OF VARIANCE FOR SCORES FOR AREA VI PUPIL-TEACHER RELATIONS 


Source of Sum of Mean 

Variation dif. Squares Squares F P Hypothesis 
Between PEY 11 922. 93 83. 90 1.20 7.05 accepted 
Groups 

Judges in the 108 7,572, 00 10.11 

Same Group 

Between Judges 119 8, 494. 93 

Between L 1 248. 06 248. 06 8.23 <.01 rejected 
Interaction: 11 190. 54 17.32 2.05 accepted 
L x PEY Groups 

Interaction: 108 3,254. 40 30.13 

Pooled Judges 

xL 

Within Judges 120 3,693. 00 

Total 239 12, 187. 93 


тонна а ы алыс hens dq holding oU MV o WHO, 9 Т P NN 
The hypothesis tested was that there were no significant differences between the means of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d.f., the value of F re- 
quired for rejection was 3.94. 
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TABLE VIII 
ANALYSIS OF VARIANCES FOR SCORES FOR AREA VII EVALUATION 


Source of Sum of Mean 

Variation d. f. Squares Squares F P Hypothesis 
Between PEY 11 2, 080. 98 189. 18 1.63 2.05 accepted 
Groups 

Judges in the 

Same Group 108 12, 552. 00 116.22 

Between Judges 119 14, 632. 98 

Between L 1 176.81 176.81 10.28 <.01 rejected 
Interaction: 11 264. 39 24.04 1.40 >.05 accepted 
L x PEY Groups 

Interaction: 108 1,857.80 17.20 

Pooled Judges 

xL 

Within Judges 120 2,299. 00 
LL MM. 
Total 239 16, 931. 98 


The hypothesis tested was that there were no significant differences between the means of the 
groups. For 11 and 108 d.f., the F-ratio required for significance at the 5 per cent level of 
confidence (or rejection of null hypothesis) was 1.88. For 1 and 108 d.f., the value of F re- 


quired for rejection was 3. 94. 
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colleagues, parents and non-teaching personnel. No 
significant differences were revealed between the 
two groups in this area. 


Area П - Preparation, Planning and Management 


This area is concerned with the effectiveness 
of the teacher within the classroom. The emphasis 
has been placed in this study on the use of instruc- 
tional materials and teaching procedures. The a- 
nalysis of data іп this area showed significant dif- 
ferences favoring the permanently certified elemen- 
tary School teacher. 


Area III - Subject Matter 


In this study, subject matter represents those 
areas in the curriculum which are generally taught 
in the elementary schooland which are recommended 
by the State Department of Education of the State of 
New York. Significant differences between the two 
groups were also noted in this area, again showing 
the permanently certified elementary school teacher 
in a more favorable light. 


Area IV - Instruction 


Instruction is considered to be the ability and 
the skill of the classroom teacher in motivating 
learning and in emphasizing study skills. In this 
study, emphasis is placed on the instructional ma- 
terials used and adaptations of teaching methods. 
Significant differences were noted between the two 
groups. However, although there were significant 
differences between the two groups in their entire- 
ty, there were no significant differences as applied 
to the first year of teaching. 


Area V - Parent- Teacher Relations 


Parent-teacher relations involve direct re- 
lationship on the part of the classroom teacher with 
the parent in informing the parent of school routine 
and child's progress, encouraging classroom visi- 
tation, cooperation with Parent- Teachers Associa- 
tion and utilizing parents as resources. In this 
area, nosignificantdifferences were shown between 
the two groups. 


Area VI - Pupil-Teacher Relations 


In this study, pupil-teacher relations refers to 
an understanding of the pupil, including his interest, 
needs, growth and developmentalong witha sympa- 
thetic attitude. In this area, there were significant 
differences between the provisionally certified ele- 
mentary school teacher and the permanently certi- 
fied elementary school teacher. 


Area VII - Evaluation 


In this study, evaluation refers to the ability 


to understand each child including the use of such 
resources as sociometric techniques, pupil cumula- 
tive folders and interpretations of test scores avail- 
able for helping in the adjustment of each child. In 
this area, there were significantdifferences between 
the two groups. The permanently certified elemen- 
tary school teacher showed greater insight in the 
total evaluation of the individual growth of pupils 
using such resources as: psychologist,test results, 
guidance information and pupil's cumulative 
record folders more extensively than the provision- 
ally certified elementary school teacher. 

The evidence seems toindicate that the per- 
manently certified elementary school teacher is more 
successful in teaching effectiveness than the provi- 
sionally certified elementary school teacher in the 
first, second and third years of experience in the 
following areas: Area II, Preparation, Planning and 
Management; Area III, Subject Matter; Area VI, 
Pupil-Teacher Relations and Area VII, Evaluation. 
Also, in Area IV, Instruction, there were signifi- 
cant differences between the second and third year 
teachers as compared with the first year teachers. 

More specifically, the permanently certified 
elementary school teacher received better ratings 
than the provisionally certified elementary school 
teacher in the following areas: (a) the ability to or- 
ganize and plan effectively, (b) the skill to translate 
subject matter into living experience, (с) the pro- 
ficiency in using effectively related materials in 
classroom instruction, (d) anunderstanding and more 
sympathetic attitude toward the child, (e) the ade- 
quate use of such resources as remedial reading 
teacher, speech therapist, art and music special- 
ists, school-nurse teacher and school psychologist 
in the further understanding of the child. 
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The High School Schedule in Midwestern Secondary Schools 


А NUMBER OF articles have been written on 
schedules, most of which illustrate particular pro- 
cedures that may have been successful ina particu- 
lar school or system, and have been neither com- 
pared with other systems nor evaluated by any other 
criteria except that of giving satisfaction within the 
particular school or system under discussion. 

Ingeneral, educators have worked with several 
methods of constructing the daily schedule. They 
have not studied the advantages of different schedule 
provisions in terms of staff and building utilization. 
This leaves the administrator in a position of con- 
structing the daily schedule in one of several ways 
without knowing the comparative advantages of 
Schedules of different types such as eight forty- 
minute periods and six fifty-five-minute periods, 
the best time for scheduling activities, andthe like. 

This study was made to determine current prac- 
tices and comparative advantages with respect to 
different types of schedules inasample of North 
Central Association secondary schools in five states. 

To collect the necessary data, visits were made 
to each office of the NCA Chairman of Iowa, Illinois, 
Minnesota, Missouri, and Wisconsin. Reports were 
examined of 938, or 97 percent of the member 
Schools for the 1958-59 school year. 

Additional information was obtained on the sched- 
uling of classes and activities by visiting 106 se- 
lected schools (11.3 percent of the total number of 
schools in the study) according totheir size and type 
of schedule. Interviews were held with regard to 
their schedule, with the principal and/or superin- 
tendent in each visited school, and forms were left 
with the administrator for distribution tothe class- 
room teachers to permit their rating of the sched- 
ule under which they worked. А total of 836 forms 
from teachers were received (60.7 percent of the 
teachers in visited schools). These forms provid- 
ed information on teacher load and opinions about 
the type of schedule employed in their school. 


Summary of Findings 


Organization of the Daily Schedule. The opening 
time of schools ranged from 7:00 a.m. to 9:30 a.m., 


with the mode at approximately 8:30 a. m. Large 


A. W. STURGES, 
Winona State College, 
Winona, Minnesota 


Schools (enrollment of over 1,000) tended to open 
earlier than small schools (enrollment of under 250). 
The opening time was usually influenced by local 
conditions. Small schools attempted to schedule 
the opening time so that students would not have to 
board school buses too early in the morning; large 
schools were occasionally influenced by the availa- 
bility of public transportation, and were required to 
open the school day so that students could use public 
transportation so as to avoid the rush hour. 

The dismissal time of school ranged f r o m noon 
to 5:00 p. m. Approximately 25 percent of the 
Schools dismissed between 3:30 p.m. and 3:39 p. m. 
There was a tendency for schools of over 1,000 en- 
rollment to dismiss earlier than small schools. 

The length of the school day ranged from five 
hours to ten hours. The modallength of the day 
was from 7:00 to 7:09 hours. Thelength of the 
School day in schools of over 1, 000enrollment 
ranged widely due to the use of the split or double 
shifts. 

The passing time beween classes varied from one 
minute to eight minutes; approximately 36 percent 
of the schools scheduled from 2:01 to 3:00 minutes 
between classes. Larger schools scheduled a long- 
er passing time than small schools. 

Several different patterns were employed in 
Scheduling the lunch period, including split, stag- 
gered, single, closed, and open types. Although the 
range of time was from 19 minutes to 120 minutes, 
the most frequent length was 30 minutes. 

The length of class periods varied withthe num- 
ber of periods employed. For example, the modal 
length of periods for those schools using the six- 
period schedule was 55 minutes, and the length of 
periods varied considerably in those schools using 
the seven-period schedule. The period wasusually 
45 minutes in length in schools with an eight-period 
Schedule, and 40 minutes was the length of periods 
in those schools using the nine-period schedule. 
Schools with more than nine periods per day were 
usually on a split or double shift; the length of their 
class periods was usually 55 minutes. 

The number of periods in the school day varied 
from five periods per dayto eleven periods plus 
activity period per day. The most frequently 0C- 
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curring type of schedule was the six-period sched- 
ue. Approximately 40 percent of the schools in this 
study used this type of schedule, with or without an 
activity period. Approximately 30 percent of the 
schools used the seven-period schedule, with or 
without an activity period, and approximately 22 
percent used the eight-period Schedule withor with- 

' out an activity period. Тһе remainder of the schools 
used either the five-period schedule or more than 
eight periods per day. Approximately 14 percent 
of all schools scheduled an activity period. 

Organization of the Weekly Schedule. Athletics 
and music activities were most often scheduled ev- 
ery day, or 20 times a month, regardless of enroll- 
ment group. The frequency with which staff meet- 
ings for the school paper and annual were scheduled 
increased as the enrollment increased. 

In Group I schools (less than 250 enrollment), 
most athletic activities were scheduled during a 
regular period. In Group П schools (enrollment of 
250-499), less than half of those visited reported 
athletic activities scheduled during a period within 
the regular school day. In Group Ш schools (en- 
rollment of 500-999) and Group IV schools (over 
1,000 enrollment), all administrators indicated that 
athletic activities were Scheduled after school. 
Large schools scheduled activities ata specific time 
more often than small schools. 

To provide for flexibi lity in the schedule, ap- 
proximately 97 percent of the visited schools ex- 
cused students from study hall for special activities 
or assignments. The next mostfrequently mention- 
ed method was that of excusing students from regu- 
lar classes, with approximately 70 percent of the 
schools following this practice at some time. Most 
administrators emphasized, however, that this 
practice was sparingly used. Fifty percent of the 
schools scheduled some classes for double periods 
to allow for flexibility. Limited use of a floating or 
of dovetailing periods was reported. 

The mean number of minutes devoted to various 
courses varied with the number of periods in the 
school day. For all schools using the six-period 
schedule, the mean number was 279 minutes per 
week. For schools using the seven-period schedule, 
the mean number was approximately 266 minutes. 
А mean number of approximately 236 minutes per 
Week was used in schools with an eight-period 
Schedule. For schools using the nine-period sched- 
ule, the mean number was approximately 220 min- 
utes per week. 

The only courses that were scheduled for more 
time as the number of periods in the schedule was 
increased were home economics, agriculture, and 
Shop. All other courses were scheduled for fewer 
minutes per week when the number of periods in- 
creased. 

Comparison of Classes per Teacher by Type of 
Schedule. Approximately . percent more 
Classes per teacher were offered in schools with 
an eight-period schedule than in schools with a six- 


period schedule. Ап average of 12.2 percent more 
classes per teacher wereoffered inschools employ- 
ing а seven-period schedule than in schools using a 
six-period schedule. Teachers in schools with an 
eight-period schedule taught an average of 8.2 per- 
cent more classes than teachers in schools witha 
seven-period schedule. 

When 1) teachers’ salaries are considered as the 
major item of cost of instruction, 2) the more clas- 
ses taught per teacher, the greater the economic 
advantage to the school district, 3) the length of pe- 
riod does not affect student achievement, and 4) the 
sizes of class sections would even out under differ- 
ent types of schedule when a large number of scools 
are included, some advantages are indicatedfor the 
seven- and eight- period schedules. 

With an advantage of 21.4 percent more classes 
per teacher with the eight-period schedule over the 
six-period schedule, aschool system with 20 teach- 
ers could offer 19 more sections per day. Ifthe av- 
erage salary paid to these twenty teachers was 
$5, 200 per year, the school district would realize 
21.4 percent more in teaching services for its mon- 
ey, which would amount to an $18, 300 advantage to 
the school district. 

The seven-period schedule can offer 12.2 per- 
cent more classes per teacher than can the six-pe- 
riod schedule. Using the preceding illustration, a 
School district could offer about 11 more classes 
per day or an $11, 400 advantage inteachers' salar- 
ies during the school year. With the comparison of 
the seven- and eight-period schedules, an advantage 
of 8.2 percent more classes are shown, in favor of 
the eight-period schedule. With 20 teachers ina 
school system, this would amountto a yearly advan- 
tage of $8, 800 to the school district. 

Teacher Load. Teachers inschools using an 
eight-period Schedule taught more classes and more 
students per week thanteachers іп schools with few- 
er periods. Approximately 5.4 classes were taught 
by teachers in an eight-period schedule. They also 
taught an average of 555 students per week, had a 
mean class size of 23, and supervised approximate- 
ly 0.8 study halls daily. This was morec lasses, 
more study halls, and more students per week than 
teachers with other types of schedules. 

By using a modified Douglass formula for deter- 
mining teacher load, it was found that there was an 
increase in the teaching load expressed in teaching 
units, with an increase in school enrollment for 
most subject fields. Teachers in English, soc ial 
Science, and other academic subjects tended to have 
a heavier load than teachers in agriculture, physi- 
cal education, and the like. However, there was 
some decrease in the number of teaching units with 
anincrease in the number of periods per day. A 
mean of 24.4 teaching units was the teacher load for 
schools with а six-period schedule, the teacher load 
was 23.7 units for schools with an eight-period 
Schedule. The size ofindividual class sections was 
not taken into account in thes e calculations. How- 


66 THE JOURNAL OF EDUCATIONAL RESEARCH 


ever, the total number of pupils per teacher per 
week was included indetermining the relative teach- 
er loads under different types of schedules. 

Opinions of Teachers. Criticisms by teachers 
of the schedules varied with the type of schedule 
used. Teachers with a six periodschedule most 
frequently criticized the size of classes, inflexibil- 
ity of the schedule, and interruptions of classes for 
activities. Teachers in a seven period schedule 
most frequently criticized the varying lengths of pe- 
riods, number of preparationsrequired by the 
teachers, and class size. Teachers in an eight-pe- 
riod schedule most frequently criticized the short 
length of the class periods. Thepercentage of 
teachers criticizing the various provisions in the 
Schedule, however, was small. 

The rating of the schedule also varied with the 
type of schedule employed. Teachers with a six- 
period schedule rated the length of the class periods 
and the length of the school day as more satisfactory 
than did teachers in an eight-period schedule. 
Teachers with an eight-period schedule rated the 
length of the class period and the num ber and size 
of study halls as less satisfactory than teachers in 
Schools that used less than eight periods a day. 


TABLEI 


Classroom Utilization. The use of rooms de- 
signed for typing, bookkeeping, agriculture, shop, 
music, science, and home economics as regular 
classrooms increased as the enrollment of the 
School increased. Classroom use ofthe auditorium 
or of rooms designed for study halls decreased as 
the enrollment of the school increased. Rooms de- 
signed for agriculture and music were used less 
frequently than most other rooms. 

Classrooms were morefrequently used for clas- 
ses in schools using the eight-period schedule than 
in schools using fewer periods per day. For exam- 
ple, Schools using the eight-period type of schedule 
had a mean of 4. 7 classes per room; schools using 
the seven-period schedule had a mean of 3. 9 classes 
per room. 

The use of classrooms also increased withthe 
enrollment of the schools. For example, the mean 
number of classes per room for schools of GroupI 
was approximately 3.1; for schools of Group II, the 
mean number of classes per room was approximate- 
ly 4.0. There was a slight dec rease to a mean of 
3.9 classes per room for schools in Group IIl. 
Schools of Group IV scheduled a mean of 4.4 classes 
per room. 


SUMMARY OF MEAN NUMBER OF MINUTES PER WEEK FOR ALL SUBJECTS, 
ACCORDING TO ENROLLMENT GROUP AND TYPE OF SCHEDULE 


Enrollment--Size Group 6-Period 7-Period 8-Period 9-Period 
0-249 211.19 263.80 235.56 219.66 
250-499 280.93 265.97 242. 69 216. 08 
500-999 277.48 269.01 250.35 232.70 
1000 and above 276. 63 267.14 226.52 217.36 
Mean 279.08 266.31 235.86 220.17 


— 
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TABLE II 


COMPARISON OF PERCENTAGE DIFFERENTIAL IN NUMBER OF CLASSES PER 
TEACHER BY TYPE OF SCHEDULE, BASED ON 938 SCHOOLS 
ARI —————— 


Percentage Advantage in Classes per Teacher of Different Schedules 


Types of Schedules Schedules by Enrollment Groups 
E - - ў Schools 
7 period vs. 6 period 15.78 10.18 5.57 8.73 9.35 
8 period vs. 7 period 10.39 11.87 12.48 17.51 12.34 
9 period vs. 8 period 13.96 8.01 8.64 -2.65 9.55 
8 period vs. 6 period 25.83 23.26 18.75 27.77 22.85 
9 period vs. 6 period 43.94 33.13 29.01 24.38 34. 58 
9 period vs. 7 period 25.80 20.83 22.20 14.39 23.08 
TABLE III 


SUMMARY OF TEACHER LOAD, BY TYPE OF SCHEDULE 
a AAAOLLLUOC. Е 


6 period 7 period 8 period 
Mean -N Average N Average N Average 
ИРЕН Nate ат роет о OE cepa AN a дара лке ЕДЫ 
“Number of Classes 407 4.47 225 5.01 117 5.42 
Number of st. Halls 410 0.45 227 0. 60 118 0.79 
Number of Students Weekly 415 540. 84 229 520. 56 117 554.83 
| Number of Classes plus St. Halls 4.93 5. 61 6.21 
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TABLE IV 
COMPARISON OF TEACHER LOAD IN PERCENT, BY TYPE OF SCHEDULE 


Item 8 period vs 6 period 8 period vs. 7 period 7 period vs. 6 period 


Number of Classes 21.39 8.16 12.23 

Number of Classes plus St. Halls 26.57 10. 68 12. 97 

Number of Students Taught Weekly 2.59 6. 58 -3. 90 
TABLE V 


MEAN NUMBER OF PERIODS SPECIAL CLASSROOMS ARE USED IN 
SCHOOLS, BY SCHEDULE 


Room 6 period 7 period 8 period 
No. Mean No. Mean No. Mean 


Typing 51 5.11 32 5.12 16 5.92 
Bookkeeping 47 4.80 21 4.50 14 5.11 
Agriculture 36 4.19 28 4.89 14 4.07 
Shop 51 4.78 26 4.74 17 5.40 
Music 51 4.74 31 5.06 18 4.57 
Science 52 4.81 31 5.19 19 6.05 
Home Economics 50 5.02 31 5.04 20 5.58 
Study Halls 49 6.14* 32 6.96 20 6.94 
Auditoriums 52 1.79 32 0.28 20 0.20 


* Includes some schools with six periods plus an activity period. 
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TABLE VI 


PERCENTAGE DIFFERENCES IN ROOM UTILIZATION BY SCHEDULE 
WITH SCHOOLS GROUPED BY ENROLLMENT 


“ЕССЕ 


Enrollment-Size Group 7 period vs. 6 period 8 period vs. 7 period 8 period vs. 6 period 


—— DNE EE. : v 3 


0-249 21.96 12.22 36.86 
250-499 -2.84 13.95 10.53 
500-999 20. 59 -15.38 4.28 
1000 and above -4.53 50. 63 44.97 


----- а-а ааа на т анат ИИ Ыы", TIRE 


TABLE ҮП 


Y 
COMPARISON OF SEVEN- AND EIGHT- PERIOD SCHEDULES ON ECONOM 
AND LOAD FACTORS IN 106 HIGH SCHOOLS 


Те of ils Weekl Total Load (Douglass) ^ Mean Classes per Teacher 
Ew — cem eum ES нег PY Difference No. % Difference No. % Difference 


Period 22 521 24.94 5.01 


Period 23 4.3 555 6.1 23.70 -4.9 5.42 8.16 
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Conceptions of "Ideal" Leadership in Accepted and Rejected 


Principal Training Candidates 


WITHIN THE PAST few years, new emphasis and 
direction have been givento research in the prob- 
lems of leadership. The Ohio State Leadership 
Studies which were initiated in 1945 have made im- 
portant contributions.tothis area, and the sever- 
al dimensions of leader behavior developed inthese 
studies are proving useful in research. The first 
dimension, "Consideration, ” refers to activities of 
the leader which promote harmony and in-group 
feeling. The second dimension, ‘‘Initiating Struc- 
ture," refers to leadership activities which point 
the way toward the goal and mark out the procedures 
to be used in making progress toward it. Through 
the use of appropriate measurements, a subject 
may report on these dimensions as they appear in 
his own leader behavior or as he sees them in the 
behavior of others; furthermore, he may report on 
“тезі” behavior (what is done) or ‘‘ideal’’ behavior 
(what should be done). 

These two dimensions of leader behavior have 
been explored in a wide variety of situations includ- 
ing a number involving school settings. 2 Halpin (3), 
for example, compared a group of superintendents 
and principals with one composed of aircraft com- 
manders, Each subject described his own “‘ideal’’ 
leadership behavior while several subordinates de- 
scribed his ‘‘real’? behavior. For both ‘‘ideal’’ and 
“real”? measurements, educational administrators 
were higher in Consideration, and aircraft com- 
manders were higher inInitiatingstructure. Hemp- 
hill (4) found that chairmen of college departments 
which have the best reputationfor good administra- 
tion were described as above average on both Con- 
Sideration and Initiating Structure. 

As part of a larger project designed to study 
leadership style among school executives, Seeman 
(5) had a group of teachers evaluate their superin- 
tendents in terms of Consideration, Initiating Struc- 
ture and effectiveness as school leaders. It was 
found that all three dimensions were positively cor- 
related; better leaders tended to be higher in both 


* Footnotes will be found at the end of the article. 
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Consideration and Initiating Structure than poorer 
ones. 

Research indicates that measures of Considera- 
tion and Initiating Structure canbe useful indices of 
leader behavior in a variety of school situations. 
The present study attemptedto investigate this 
problem further by comparing conceptions of “іде- 
al" leadership in accepted and rejected leadership 
training candidates. 


Method 


The subjects of the present study were 93 candi- 
dates for a leadership training program sponsored 
by the Department of Public Instruction of Hawaii 
and by the University of Hawaii. This program is 
designed primarily to train personnel for princip al- 
ships and vice-principalships in Islan dschools. 
Each candidate was administered a battery of tests 
including the Miller Analogies Test, Strong Voca- 
tional Interest Blank, Minnesota Teacher Attitude 
Inventory, Rokeach Dogmatism Scale, Rokeach and 
Eglash Intellectual Conviction Scale, Rorschach 
Inkblot Test (group administered and scor ed ac- 
cording to the Monroe Inspection Technique), Ed- 
wards Personal Preference Schedule, anda projec- 
tive biography blank. 

Actual acceptance and rejection of the candidates 
was performed by a selection board which consulted 
the results of the test battery along with a number 
of ratings and evaluations of each candidate. Of the 
93 candidates, 33 (22 males, 11 females) were ac- 
cepted for the training program and 60 (42 males, 
18 females) were rejected. 

For the purposes of the present research, the 
Leadership Opinion Questionnaire (LOQ) in slightly 
modified form was administered to the candidates 
along with the regular test battery. Developed by 
Fleishman (1, 2), the LOQ is a 40-item multiple- 
choice questionnaire which providesa simple and 
convenient measures of Construction and Initiating 
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Structure. In executing the questionnaire, the sub- 
jects were asked to describe their ‘‘ideal’’ leader- 
ship, that is, what they ‘‘should do" in each situa- 
tion described by the testitems. The results of the 
questionnaire were not considered by the selection 
board in its selection and rejection of the candidates. 


Results 


The data for the male and female subjects were 
combined after preliminary analysis indicated that 
there were no statistically significant differences 
between them. For the accepted and rejected 
groups, respectively, Consideration means were 
63.94 and 62.43 (t = 1.12), and Initiating Structure 
means were 42.15 and 44. 38 (t = 0.54); neither of 
these differences proved to be statistically signifi- 
cant. Variance scores of the accepted and rejected 
groups, respectively, for Consideration were 22.99 
and 43.84 (F = 1.91), and for Initiating Structure 
were 35.73 and 64. 30 (F = 1. 79); both of these dif- 
ferences were significant at the 5% level of confi- 
dence. These differences in disparation can be 
Seen from an inspection of the data as they are 
plotted in Figures 1 and 2. 


Discussion 


The similarity of means of the accepted and re- 
jected candidates on both Consideration and Initiat- 
ing Structure would seem to indicate that the two 
groups had similar conceptions of ''ideal" leader- 
Ship. However, significant differences in variance 
suggested an importand discrepancy between the 
groups. Specifically, the members of the accepted 
group were more in agreement among themselves, 
that is, they were more homogeneous in their con- 
ception of ‘‘ideal’’ leadership while the rejected 
group proved to be more heterogeneous. 

Rejected candidates tended to һауе more extreme 
conceptions of **ideal'' leadership. With reference 
to Consideration, some rejectees seem ed to con- 
ceive of the “‘ideal’? leader as one who was highly 
Solicitous of his followers; by contrast, other re- 
jectees pictured an ‘‘ideal’’ leader who was rela- 
tively indifferent to the feelings of his followers. 
With reference to Initiating Structure, some rejec- 
tees described an ‘‘ideal’’ leader whohighly organ- 
ized and controlled each situation, while others saw 
him as a person who would offer his group 1 ittle 

framework. 

By contrast, the accepted candidates more often 
proved to be moderate intheirconceptions. Accep- 
tees tended to take intermediate positions on both 

Consideration and Initiating Structure. They con- 
ceived of an ‘‘ideal’’ leader who combined consider- 
ation with distance and structure with freedom. 

The results suggest that agreement with a nor- 
mative ‘‘ideal’’ might be a fruitful index of leader- 


ship potential. In the present instance, at least, 
the selection procedures favored ‘‘middle-of-the- 
roaders and discriminated against those candidates 
who took extreme positions concerning “ideal” 
leadership. 


Summary 


The present investigation compared the concep- 
tions of ‘‘ideal’’ leadership held by accepted and 
rejected principal training candidates. Conceptions 
of **ideal" leadership were obtained through the use 
of the Leadership Opinion Questionnaire which de- 
scribes and measures such behavior in terms of 
two dimensions, Consideration and Initiating Struc- 
ture. It was found that the accepted and rejected 
candidates, on the average, had similar conceptions 
of ‘‘ideal’’ leadership, but that the former group 
was significantly more homogeneous in its concep- 
tions than the latter. It was concluded that agree- 
ment with a normative ‘‘ideal’’ might be a fruitful 
measure of leadership potential. 
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1. The authors wish to thank Dr. E. A. Fleishman 
for his kind assistance. 

2. Much of the literature relevant to the present 
Study has been assembled in Stogdill and 
Coons (6). 
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An Application of Levels of Concept Formation 


to Measurement of Vocabulary 


METHODS OF measuring vocabulary have re- 
ceived much attention from psychologists and ed- 
ucators alike, and several techniques have evolved 
for measuring one's ability to define words, Still 
some investigators (2) feel that current approaches 
provide only asuperficial estimate of the depth and 
breadth of the concept associated with the verbal 
symbol. A device which seeks out more than a single 
level of understanding would seem desirable there- 
fore in the measurement of vocabulary meanings 

In a study of the formation of concepts, Reichard, 
Schneider, and Rappaport (7) have identified three 
Sequential levels of concept development: (a) the con- 
crete, (b) the functional, and the (c) conceptual or 
abstract. Severalattempts have been made to utilize 
this three-level sequence in increasing the precision 
in measurementof vocabulary items in intelligence 
tests, butonly moderate success has been achieved. 
In the initial effort, Gerstein (4) constructeda 
multiple choice test following the pattern proposed 
by Reichard, Schneider, and Rappaport, using the 
vocabulary scale in the Wechsler Bellevue examin- 
ation. For every stimulus word she provided one 
alternative representing each of the three levels of 
conceptformation. Subjects were instructed to sel- 
ect the best definition of the term, the hypothesis 
being that persons operating on alevelof concretistic 
Concepts would select the concrete description of the 
Stimulus word, andsoon through all three levels of 
Concept formation. Gerstein’s results illustrated a 
definite trend in the hypothesized direction, but the 
utility of the three-level approach to vocabulary 
Measurement was not convincingly demonstrated. 

Using a similar design, Stacey and his associated 
(9,10,11) have also attempted to construct group 
tests with Wechsler’s vocabulary items. These ef- 
forts, too, have failed to provide clear-cut evidence 
which would support the three-level approach as a 
means of evaluating general vocabulary development. 

In a study of agechangesin concepts, Feifel and 
Lorge (3) found qualitative differences between con- 
Cepts of young and older children. Following these 
findings, Kruglov (5) attempted to fit Gerstein’s 
technique to the measurement of age changes in de- 
finitions of vocabulary items from the Stanford-Binet 
Although her test failed to distinguish significantly 
betweenher five age groups, the trend was again in 
the proposed direction. 
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Dolch and Leeds (2) have also used a multiple re- 
sponse approach to improve vocabulary mea sure- 
ments, but their effort was toward extending the 
Scope of items rather than testing a three-level ap- 
proach. Similarly, Schaaf (8), Sueltz (12), and 
Brownell (1) have attempted to advance the precision 
of measuring concepts in specific academic areas, 
but none of these writers followed the progression 
of levels described by Reichard, Schnieder, and 
Rappaport. Therefore, these studies must be con- 
sidered peripheral to the present investigation. 

The body of research noted above has not entirely 
demonstrated utility in the three levels of concept 
formation as a basis for constructing tests of one's 
ability to define words: nevertheless, throughout 
these studies the consistency of trend alone may be 
taken as some support for the adequacy of the method. 
it appears that the technique provides at leastan 
operational description of response types, if not 
destinctly separate catagoriesin all cases, Further 
study of the procedure seems justified. 

In the investigations utilizing the three-level ap- 
proach to vocabulary measurement, the effort has 
been toward extending the application of sections 
of intelligence tests. If the procedure described by 
Gersteincan be presumed to increase the precision 
of vocabulary measurement as found in tests of in- 
tellectual ability, the method mayalso be expected 
to provide a more precise evaluation of vocabulary 
in specific academic areas, e.g., science, geogra- 
phy, arithmetic. Thus, the purpose of the present 
study wasto test the applicability of Gerstein's three- 
level technique to the measurement of vocabulary 
specific to a school subject, in the present case, 
arithmetic. 


Procedure 


The initial task was the constructionof a multiple- 
choice test of arithmetic vocabulary which would be 
consistent with Gerstein's three-level h y pothesis. 
To this end, words were selected from Marsten' s 
list (6), approximately one-fo urth of the terms 
coming from those appearing in the upper quarter in 
frequency in the combined fourth, fifth, and sixth 
grade levels; one-half of the words comingfrom 
those which appeared medially infrequency; and one- 
fourth coming from the least frequently oc curing 
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quarter of Marsten's count. No claim for order of 
difficulty is proposed for a list complied on the basis 
of how often words have appeared to students, al- 
though some relationship may exist between the ex- 
tent of familiarity a student might have with a word 
and his frequency of encounter with it. The proced- 
ure was proposed to bring variety of terms into the 
list rather than to provide graded difficulty. Twenty- 
five words were selected for the trial test. After the 
terms were chosen, the next step was to build the 
alternative responses for each item. Four defini- 
tions were constructed for each word--one defining 
the term withaconcrete example, one definition re- 
presenting the functional or usage level, and a third 
on anabstractorconceptuallevel. The fourth alter- 
native -- а distracter -- was not appropriate to the 
term. All words used in building response choices 
were checked against the Thorndike-Lorge list to be 
consistent with the vocabulary level of the age group 
for which the test was designed. The following ex- 
ample illustrates the nature of the test items. 


Quotient: 


(a) in the problem, 842-4, the quotient is the 4 

(b) in a multiplication problem the multiplier is 
called the quotient 

(c) the result of dividing one number by another 

(d) when you do a division problem and get an 
answer, you call it the quotient 


Students were instructed that more than one answer 
may seem correct, and that they should choose the 
one answer that they believed was the best definition 
of the term. 

After the author had constructed the multiple re- 
sponses, acolleague edited the material for the pur- 
pose of fitting the alternatives more closely to the 
prescribedlevelsof concept formationas utilized by 
Gerstein. At this point the responses for each item 
were arranged in random order by use of a table of 
random numbers. Thedevice was then administered 
for the purpose of evaluating items. 

The first analysis of the test was based on data 
collected from twenty-six fifth gradestudents in an 
urban school in northern California, who took the 
test in the Spring of the year. Three groups of sub- 
jects were arranged as follows. Persons whose 
papers revealed a plurality of abstract response 
selections were classified as ‘thigh scorers’’, those 
students showing a concentration on functional re- 
sponses were called ‘‘middle scorers” and the ones 
who chose a bulk of concrete alternatives, ** low 
всогегв.”” Then tabulations were made showing the 

number of times each response type was chosen by 
each group for each item, the hypothesis being that 
the ‘thigh scorers’’ should choose the abstract de- 
finition for every term more often than other alter- 
natives, for the ‘‘middle scorers” the functional 
response should show the greatest frequency of se- 
lections, and forthelow scorers the concrete choice 


should be most often chosen. Items which did not 
essentially conform to this structure were elimin- 
ated or revised. Sixteen items now remained to be 
used in subsequent administration of the test. 

For thenext analysis of the device a second sam- 
ple of students was obtained. This time 141 sixth- 
grade pupils took the test in the Fall of the year. 
These subjects were from two elementary schools 
in an urban area adjacent to the one from which the 
first sample of students had been chosen. 

The results of this second trial of thetest were 
analyzed asfollows. First, students were classified 
on the basis of the response type from which they 
made a pluralityoftheir choices. Those who select- 
ed more concrete alternatives than any other type 
were called ‘‘concretistically oriented?’ those who 
concentrated on functional responses were classi- 
fied as ‘functionally oriented’’ and those who chose 
more conceptual responses than other types were 
labeled as ‘‘conceptually oriented. ’’ In this manner 
three groups again were formed asin the original 
trial of the test. There were thirty-five subjects in 
the conceptually oriented group, fifty-nine function- 
ally oriented, and forty-seven concretistically or- 
iented. 

At the time of administering the experimental 
testa second test of arithmetic concepts, the ** Fund- 
amental Knowledge and Vocabulary’’ section from 
the arithmetic battery inthe Iowa Every-Pupil Tests 
of Basic Skills, was also administered for the pur- 
pose of establishing а criterion. Then a distribution 
of scores on the criterion test was made for each of 
the three groups formed on the basis of the experi- 
mental examination. 

Bartlett’s testfor homogeneity of yariance 154 ю 
the rejection of the hypothesis that o1 2-02 2-03 ^, so 
the t- ratio was used to test the significance of dif- 
ferences between mean criterion scores for the three 
groups. 


Results 


As noted above, three groups of subjects were 
formed on the basis of the experimental test. Dis- 
tributions of their criterion scores provided the de- 
scriptive data produced in Table I. Here it can be 
seen that the mean scores increased with each con- 
cept level, moving from the concretistic group to 
the conceptually oriented group. Initially the hypo- 
thesis was tested that there was no difference be- 
tween mean criterion scores for the concretistically 
oriented and the functionally oriented subjects. The 
resulting t was 2.07. Since the five percent level 
had been established as the point at which the hypo- 
thesis would be rejected, thist reveals a significant 
difference between the means of these two groups. 

Next, the same procedure was applied to testing 
the null hypothesis forthe mean criterion scores of 
the functionally oriented and the conceptually orient- 
ed groups. Inthis case the t was found to be 2. 18, 
again revealing a significant difference at the five 
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TABLE I 


CRITERION DATA FOR THE THREE EXPERIMENTAL GROUPS 


Conceptually Functionally Concretistically 
oriented oriented oriented 
Mean 15.20 12.23 10. 51 
Standard 
Deviation 7.14 4.89 3.65 
iu 2.18 2.07 


------------------------------------ 


percent lev el. 

Further analysis of the test was desired, but 
Such extension depended upon devisinga Scoring 
Scheme which would provide weights for each of the 
three typesof responses. Therefore, these alterna- 
tive choices were weighted in proportion to the re- 
lative magnitude of the mean scores on the criterion 
test achieved by each of the three groups of subjects 
as noted in Table І. By this procedure the concrete 
responses received one point, the functional re- 
sponses were given 1.2 points, the conceptual alter- 
natives 1.5 points. This weighting scheme would 
appear tobe superior to a strictly arbitrary assign- 
ment of values, although it would be even more de- 
fensible if evidence could be provided to showthe 
experimental test to be a ratio scale. Such evidence 
is, however, not available. 

By means of the above scoring technique a total 
Score was computed for each subject on the vocabu- 
lary examination. These figures were then correlated 
with other available test data. The resulting coeffi- 
cients were: with PMA-Verbal .31, with PMA-Num- 
ber ,29, with computation from the Iowa Every-Pupil 
Tests of Basic Skills .20, and .43 with problems 
from this Iowa test. It appears that the experimental 
scale is only moderately related to intellectual fac- 
tors and academic skills, although the size of the co- 
efficients would no doubt be greater with an expanded 
range of ability. 

One additional step in the test analysis seemed 
appropriate - the computation of reliability. The test 
was split on an odd-even basis, and separate scores 
were compiled for each half, using the weighting 
scheme noted above. Correlating these data, and 
using the Spearman-Brown correction, produced a 
reliability coefficient of . 76. 


Discussion and Conclusions 


If children pass through a sequence of levels in 


developing concepts associated with the material: 
presented by Reichard, Schneider, and Rappaport 
it may be hypothesized that the same sequence 
would appear in the development of concepts with 
other materials. Providing that this hypothesis i: 
true, one should be able to measure concept form- 
ation by using a tool whichdelineates several levels 
of advancement in the construction of these concepts. 
The results of the present study appear to support 
this contention, at least to the extent that children 
Showing apreference for a given level of responses 
evidence a corresponding degree of achievement or 
outside measure of concepts in arithmetic. Subjects 
who preferred the abstract alternatives on the ex- 
perimental test achieved significantly higher scores 
on the criterion than persons who concentrated on 
functional responses on the experimental device. 
Likewise, these functionally oriented pupils produc- 
ed significantly higher scores on the criterion thar 
did subjects who preferredthe concrete alternatives. 
Thus, on the practical level, ateacher using a test 
such asthe one here being evaluated could determine 
the stage of concept formation at which the students 
were operating, and subsequently choose materials 
and adapt activities in accordance with the extentto 
which the concepts of the class have progressed. 


From the results of this study one may conclude 
that students illustrate a preference for definitions 
at one level of concept formation over other stages 
the preferred level corresponding with progress її 
the development of concepts. Therefore, it appear: 
that the three-level approach to m ea surement oí 
concepts associated withverbal symbols has poten: 
tial utility as a means of evaluating students' vo. 
cabulariesin elementary school arithmetic. It may 
also be hypothized that this approach could beo! 
value in measuring the status of children's concept 
associated with terms specific to other academic 
areas as well. 
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The Comprehension of Prose Materials by College Freshmen 


When Read Silently and When Read Aloud 


Introduction 


LOGICAL REASONING suggests that oral 
reading should have certain important advantages 
over silent reading sinceitis the method which re- 
turns the skeletal written wordtoitsoriginal, more 
complex form, with the helpful modifiers of sound 
and gesture. Modern educational theory supports 
the idea that the more complete the experience, the 
more learning there will be. Yet, early research 
contradicted this in one important aspect: It was 
Shown that good silent readers could go faster, and 
that fast silent readers made higher scores when 
tested for comprehension of factual content.. Since 
Speed and comprehension are important factors in 
our modern world, schools readily shifted their 
emphasis from oral to speedy silent reading. It 
Seems clear that this was a good thing, especially 
for materials of average difficulty wherein compre- 
hension of the intellectual content was impor tant. 
However, this approach to reading has failed to take 
into serious considerationthe potential advantage of 
the oral reading (sometimes merely asortof crude 
muttering) that man often uses whenhe is presented 
With a need to comprehend complex patterns of 
thought in prose or poetry. 


Problem 
Statement of the Problem 


The general purpose of this study was to in- 
vestigate possible differences in amount of compre- 
hension for a given period of time between the oral 
reading and silent reading of certain prose mate- 
rials, The problems ofthis study were to deter- 
mine: 


1. When read silently and when read aloud does 
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either manner of reading each of the seven 
selections of prose material produce signif- 
icantly higher comprehension scores at any 
of seven levels of difficulty? 

2, When read silentlyand whenread aloud does 
either manner of reading the selections pro- 
duce significantly higher comprehension 
scores when the scores for all seven levels 
are combined? 


Definitions of Terms Used 


The following definitions of terms were ap- 
plicable within the limits and scope of the study and 
are not necessarily general definitions, 


Prose materials 


Prose materials indicated seven selections 
of mimeographed prose which were prepared from 
radio feature copy and were as factual and non- 
emotional as possible. Each selection was a story 
that constituted a sense-making whole and was 
ranked in difficulty according tothe Flesch formula 
of readability, 


Silent Reading 


In this study silent reading was defined to 
include whatever type of silent reading the subjects 
did--^ lip" reading, sub-vocal reading, or other. 


Reading aloud 


Inthis study reading aloud was defined as 
the oral reading of the prose selections in such 
a manner as to be more or less comprehensible to 
the reader himself, It was unconcerned with vocal 
projection and diction. 
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Procedure 
Selection of Test Materials 


Stories which were devised by Cartier, Good- 
man-Malamuth, and Harwoodl* and rated by them 
according to the Flesch reading ease formula2 were 
Selected. Тһе seven stories represented seven lev- 
els of reading difficulty, rangingfromgrade school 
level to upper division college level. Allthe stories 
were rated as ‘‘interesting’’ according to the Flesch 
human interest formula. Each story was followed 
by a test which consisted of 15 questions of informa- 
tion in the story. Each question was designed as a 
multiple-choice question with 5 possible responses, 
The answer sheet was constructed so that it con- 
sisted of one page and was separate from the test 
booklet. Each answer was representedon the an- 
swer sheet by its corresponding number in the test. 
Each story, typewritten in elite type, occupied one 
page, was double-spaced, and contained 300 words. 


Selection of Subjects 


Of the total 60 subjects in the experiment, 
30 performed the oral reading experiment and 30 
performed the silent reading experiment. The oral 
and silent reading groups were matched as follows: 
(a) The subjects were students from Speech 2A 
Classes at San Jose City College. (b) All the sub- 
jects were second semester freshmen. (c) None 
were foreign students or hada bilingual background, 
(d) The oral readers were not Significantly differ- 
ent from the silent readers (at the 5 per cent level 
of confidence) in the distribution of age, Sex, raw 
Scores on Schools and Colleges Aptitude Test (sec- 
tions: verbal, quantitative, and totals),and raw 
Scores on Stanford Achievement Test (sections: 
word comprehension and paragraph comprehension). 


Testing Procedure 


The 60 subjects were divided into two groups 
of 30 each. Each subject of the oral reading group 
was tested individually, and each Subject of the si- 
lent reading group was tested individually. The 
Subjects were all tested in the recording room of 
the music building at San Jose City College. This 
Sound-proof room was located between the bandroom 
and the choir studio. The recording room had a 
double glass studio-type observation window built in- 
to the wall facing the choir studio. The Choir studio 
was selected as the observation room. This studio 
had two 12-inch speakers, built into the front wall 
which were connected to a wall plug inthe recording 
room. This technical arrangement and the window 
made it possible for the investigator to listento and 

observe the oral and silent reading subjects in the 
recording room without being seen by them. The 
SS ШЫ Со нулы 

* АП footnotes will be found at end of article. 


Subject sat with his back to the investigator. The 
recording room temperature was set constant at 72 
degrees Fahrenheit, and the operation of the venti- 
lation system was inaudible. 

The reading tests were scheduled into three 
32-minute time blocks, allowing 4 minutes for in- 
Structions before actual testing began and 28 min- 
utes for the actual reading experiment. АП reading 
tests weregivenduringthe 10:25 A. M. to 12:02 P, M. 
period on school days only. Four minutes were al- 
lowed for reading and testing of each story for both 
the oral and silent readers. А cue tape in there- 
cording room gave the necessary time signals, The 
investigator spent the four-minute instruction peri- 
od with each subject, thenwentto the adjacent studio 
to observe, unnoticed by the subject for any infrac- 
tions of the test instructions. Threesubjects were 
tested per day (Saturday and Sunday excluded) which 
totaled 15 per week, The length of the experiment 
consisted of a four-week period. 


Treatment of Raw Data 


The raw scores of the tests were obtained 
Írom the number of correctly answered questions re- 
corded on the answer sheet by the subject in the form 
of a circle drawn around an appropriate number, In 
manipulating the data gathered by the foregoing pro- 
cedures, the principle statistical technique was the 
Significance of mean differences. After applying the 
procedures described, the resulting data were tab- 
ulated into two main groupings: (1) the total scores 
for each of the seven stories, and (2) the total score 
from the series of stories combined. 


Findings 
Problem 1 


The first problem of this study was to com- 
pare the silent reading scores for each story with 
the oral reading scores, The statistical results for 
each story are shown in TableI. Although the com- 
prehension of the prose material was greater in the 
oral presentation according to the difference between 
the means, only three of these differences were 
*'significant'' beyond the five per cent level of con- 
fidence. 


Problem 2 


The second problem of this study was to com- 
pare the total silent reading scores of the combined 
Stories with the total oral reading scores of the 
combined stories. There were two purposes in this: 
(1) to attempt to find whether there is a significant 
difference in comprehension of prose materials re- 
gardless of level of difficulty when read silently ог 
aloud, and (2) by combining all the test scores, to 


COLLINS 


TABLE I 


MEANS, MEAN DIFFERENCE, STANDARD ERROR OF THE DIFFERENCE 
AND T-RATIO OF THE MEAN OF THE DIFFERENCES FOR EACH STORY 


mE EFIE IMEEM É—— NN 
Predicted 


Story 55 Kind of d SEd t t t 
No. D Reading мк м M o ure eed ie) 
aa I RH DR RP шых шыл: 
7 9511 Oral 11.7) 

Very Easy Silent 10.9) .8 .37 2.16 2.00 2.66 
1 85.22 Oral 10.4 

Easy Silent Eee 1.2 . 46 2.61 2.00 2.66 
6 72.2 Oral 11.4) 

Fairly Silent 11:0) 14 .54 0.74 2.00 2.66 

Easy 

2 65.04 Oral 9.9) 

Standard Silent 9.5) -4 .48 0.83 2.00 2.66 
5 54.97 Oral 11.8) 

Fairly Silent 10.3) 155 .54 AATE "ANB 00347527866 

Diffi- 

cult 

3 39,90 Oral 9.9) 

ehe etu 9.0) .9 .56 1.6] 2.00 2.66 
4 15.08 Oral 9. 0) 

Very Silent 8.0 1.0 .63 159 2.00 2.66 

Difficult 


* N for each mean was 30 


TABLE П 


MEANS, MEAN DIFFERENCE, STANDARD ERROR OF THE 
DIFFERENCE, AND T-RATIO OF THE MEAN OF THE DIF- 
FERENCE FOR THE TOTALS OF ALL THE STORIES 


Kind of Reading M dy Edm o о 777900 

н AE TODO CORR RN Й зы eI ыссы ИБАЛЫ ША 
і 74.5 

DATEN P abo quo woo таноб нов 

Silent Reading 68. 6) 


Bue erue Тос coo Жк „шешу изле ы ш TU canis uci 
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eliminate any dependence upon the reliability or va- 
lidity of the Flesch readability formula. The sta- 
tistical results are shown in Table П. The differ- 
ence between the means in favor of the oral reading 
method was ‘‘very significant, " well beyond the. 01 
level of confidence. 


Interpretation 


An interesting pattern was noted inthe over- 
all reading ease range from ‘‘very easy" to “уегу 
difficult" in the t-ratios of this experiment. The 
stories rated ‘‘easy’’ and “уегу easy" were rated 
significant in favor of oral reading, Of the three 
Stories at the difficult end of thepattern one was 
rated very significant (number five) while the other 
two (numbers three and four) were rated insignifi- 
cant in favor of oral reading but were on a much 
higher (about the ten per cent level of confidence) 
level of confidence than the two stories (numbers six 
and two) at the center of the pattern which had a low 
level of confidence. It appeared that the oral readers 
did much better on the “еаѕу’’ and ‘‘difficult’’ ma- 
terials and only slightly better at the middle of the 
difficulty range, in comparison with the silent read- 
ers. Perhaps oral reading forces closer attention 
to “еазу” materials, and the vocal aspect could add 
interest itself, allowing for a more meaningful ex- 
perience for the reader. The added motor and au- 
ral stimuli could account for the better comprehen- 
sion of the ‘‘difficult’’ materials. This addition of 
stimuli could make the reading a more real experi- 
ence and force attention which might otherwise wan- 
der. 

The stories (numbers six and two) at the mid- 
dle of this pattern showed a sharp drop int-ratio as 
compared to the t-ratios of the stories oneither side 
in the range of difficulty. Perhaps the Flesch read- 
ability formula was not reliable at these points, This 
possibility motivated the construction of a predict- 
ability table to determine the accuracy of the Flesch 
formula in this experiment. The tabulation showed 
a three-to-one accuracy ratio in favor of the formu- 
la. This ratio was found for both oral and silent 
reading groups. Thus, there was no complete con- 
fidence that the Flesch formula truly measured dif- 
ferences in reading ease, but this margin of accu- 
racy (three-to-one ratio) seemed to justify the use 
of the formula as a tool for gross measurement in 
this experiment. 


Conclusions 


When seven prose selections representing 
seven levels of reading ease were read silently by 
one group, andorally by another group, and compre- 
hension tests weretaken, the following mean differ- 
ences between test scores were found: 


1. On the “уегу easy” selection the scores by 
oral readers were significantly higher. 
2. On the ‘‘easy’’ selection the scores by oral 


readers were significantly higher. 

3. On the “‘fairly easy” selection the scores 
by oral readers were insignificantly higher, 

4. On the ‘‘standard’’ selection the scores by 
oral readers were insignificantly higher. 

5. On the ‘‘fairly difficult’’ selection the scores 
by oralreaders were very significantly high- 
er. 

6. On the *'difficult" selection the scores by 
oral readers were insignificantly higher. 

7. On the ‘‘very difficult"! selection the scores 
by oral readers were insignificantly higher. 


When the scores onallseven comprehension 
tests were combined, and the two methods of read- 
ing (orally and silently) were compared, the follow- 
ing mean difference between total test scores was 
found: the total score by oralreaders was very sig- 
nificantly higher. 


Educational Implication 


This study found a very significant difference 
between total oral and silent reading test scores. 
This finding demonstrated sufficient statistical sig- 
nificance in favor of oral reading to merit reconsid- 
eration of the potential advantage of oral reading in 
comprehension of prose. It could be stated that the 
added motor and aural stimuli afforded the oral 
reader not apparently available to the silent reader 
may explain why the statistical results favored the 
oralreaders. Therefore, especially since these 
findings are independent of the reliability or validity 
of the Flesch formula, it could be implied that oral 
reading should be given more attention in reading 
instruction where comprehension is a problem. 


FOOTNOTES 


1. Francis A. Cartier, “Ап Experimental Com- 
parison of 'Human Interest! Factors on Lis~ 
tenability’’ (unpublished Doctoral dissertation, 
The University of Southern California, Los 
Angeles, 1951), pp. 102-116; Leo Goodman- 
Malamuth, *AnExperimental Study of the Ef- 
fects of Rate of Speaking Upon Listenability”’ 
(unpublished Doctoral dissertation, The Uni- 
versity of Southern California, Los Angeles, 
1956), pp. 95-109; and Kenneth A. Harwood, 
“Ап Experimental Comparison of Listening 
Comprehensibility with Reading Comprehensi- 
bility” (unpublished Doctoral dissertation, The 
University of Southern California, Los Ange- 
les, 1950), pp. 103-131. 

2. Rudolph Flesch, “А new Readability Yard- 
Stick, ’’ Journal of Applied Psychology, : 
228-230, June, 1948). 


3. Ibid., p. 230. 
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The Relationship of Heston Personal Adjustment Inventory Scores 


To Other Measures Commonly Employed in Counseling 


MEASURES OF personality аге commonly 
employed in batteries of tests which counselors use 
in their advisement of students. Many personality 
inventories are difficult to interpret. Тһе Heston 
Personal Adjustment Inventory (1) has enjoyed some 
popularity because it is fairly easy to administer and 
the results can be interpreted with ease. The inven- 
tory consists of 270 items which are de signedto 
yield scores on six personality traits: analytical 
thinking, sociability, emotional stability, con- 
fidence, personal relations and home satisfaction. 
Subjects are requested to answer yes or no to ques- 
tions pertaining to these areas and the results are 
profiled for counseling purposes. 

Despite the length of time the inventory has 
been available (1949), few studies have been pub- 
lished regarding its validity or usefulness. The 
writer found only one study which reported the: use 
of the instrument. Michaelis and Tyler (2) reported 
correlations among subscales which were similar to 
those found by the author of the inventory and also 
concluded that the instrument might be useful for 
diagnostic purposes. The presentstudy is designed 
to verify the reported relationships among the sub- 
Scales of the inventory and to determine the degree 
to which scores on the inventory correlate with 
measures of interest, mental ability, and reading 
achievement. 


Procedure 


Student personnel records of the guidance 
department of the Collegeof Education were em- 
ployed to select a sample of students who had been 
Counseled during their enrollment in the university. 
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It should be noted that these records were deemed 
to be typical of the student population of the college, 
and did not necessarily typify students with special 
adjustment problems. The first professional 
course in which prospective teachers enroll while 
at the university combines a program of orientation 
to the teaching profession with a guidance program 
of self-evaluation designed to help students deter- 
mine their suitability for teaching. As part of this 
course students take various tests and are counseled 
regarding their meaning. A randomly selected 
sample consisting of 84 female students was selected 
from this source for the purpose of study. АП stu- 
dents in the sample had been administered the 
Kuder Preference Record, California Test of Men- 
tal Maturity and the Progressive Reading Test as 
well as the Heston Personal Adjustment Inventory. 

In analysing the data, means and standard de- 
viations on the Heston were obtained and the inter- 
correlations among the subscales determined. 
These results are shown in Table I. 

The second phase of the study was to deter- 
mine the relationship of the various subscales of the 
Heston with the other measures. These relation- 
ships are shown in Table II. 


Results and Conclusions 


The correlations obtained among the six sub- 
scales of the Heston Personality Adjustment Inven- 
tory generally substantiated those reported by 
Heston, Michaelis and Tyler. Significant positive 
coefficients were obtained between the emotional 
stability scale and the confidence, personal rela- 
tions, and home satisfaction scales. Similarly 


TABLE I 


INTERCORRELATIONS OF THE SUBSCALES OF THE 
HESTON PERSONAL ADJUSTMENT INVENTORY 


(n - 84) 
eee 
Analyt. Emotional Confidence Personal Home 
Thinking Sociability Stability Relationships Satisfaction 
ee 
Analytical Thinking *+ .248 + .126 *+ .246 + .097 - .124 
Sociability ж. .212 *+ .520 *+ .384 + .143 
Emotional Stability жы .618 *+ .545 жы .257 
Confidence ж. .578 ж. .255 
Personal Relations * .133 
Home Satisfaction 
Mean 23.21 30. 36 25.17 26.36 25.94 40.39 
S. D. 5.71 7.07 7.55 7.86 5.44 8.53 


ж Significant at . 01 level of confidence 
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TABLE П 


CORRELATIONS BETWEEN HESTON SUBSCALES AND OTHER SELECTED MEASURES 


Kuder Preference Record 


Cal. Ment. Maturity 


Non- 
Lang. Lang. Total 


Progressive Reading 


Scien- Persua- Artis- Musi- Soc. Cleri- Out- 
Heston Mech. Comp. tific sive tic Lit. cal Serv, cal door 


Vocab. 


Total 


Analytical Thinking ^ 4.088 -.161 +.084 -.092 1.238 +.136 -.047 +.031 7.499 +.221 1.236 1.203 1.257 4.065 1.207 4.164 


.230 -.173 1.428 4.004 -.081 4.158 4.087 -.169 -.173 


1% 


Sociability -.151 


-.022 -.016 | -.032 4.047 -.001 


Emotional Stability 5.2027 -.147 1.221 -.102 4.047 -.037 -.160 4.061 -.140 4.109 4.014 4.026 | -.010 4.144 4.088 


Confidence 4.011 -.083 4.106 4.066 4.042 4.069 -.006 4.119 = 197 4.017 4.033 -.040 | +.060 4.159 4.120 


Personal Social. +.071 * 198 -.026 4.054 4.013 4.089 -.132 4.067 -.101 +.044 4.070 +.010 | 4.034 4.173 24.129 


Home Satisfaction 4.130 4.079 4.117 4.188 5322 -.114 -.162 1.324 -.1Ю -.032 | -.088 7.188 4.107 -.162 4.156 
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Significant positive coefficients were obtained be- 
tween the sociability scale and the emotionalsta- 
bility, confidence and personal relation scale. The 
magnitude of most of the obtained coefficients 
approximate those previously reported. А пойсе- 
able discrepancy between these data and previous 
Studies is the correlation betweenthe analytical 
thinking scale and the sociability and confidence 
Scales. These relationships may be due to the com- 
paratively low mear score attained by the group on 
the analytical thinking scale. 

The correlations found among the subscales 
are of sufficient magnitude so that the user of the 
inventory may desire to question the nature ofthe 
traits which are presumably being measured. 
It seems likely that several of the scales may be 
measuring manifestations of a single component or 
reflecting some general aspect of a personality syn- 
drome. Оп the other hand, the Hestonappears to 


appraise characteristics sufficiently different from 
those appraised by the Kuder Preference Record, 
the California Mental Maturity Test and the Pro- 
gressive Reading Test as to warrant its inclusionin 
a test battery designed for general advisement pur- 
poses. In addition, the items ofthe Heston have the 
advantage of being so worded that they appear 
to have face validity. 
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An Experimental Study of the Self-concept 


of Student Teachers * 


SELECTIVE RECRUITMENT is perhaps the sin- 
gle most important problem in teacher education to- 
day. A review of the current research andwriting 
in the field of education reveals wide areas of dis- 
agreement as to the factors that contribute to effec- 
tive teaching. There appears, however, to be a 
growing consensus among the authorities in the field 
that personality organization is one of the mostim- 
portant elements in determining the behavior of the 
teacher in the classroom. Lack of agreement as to 
how these personality factors may be identified and 
measured still presents a problem. Few devices 
currently on the market have proved adequate. 


The Self-Concept Theory of Personality 


One of the newer andseemingly more fruitful ap- 
proaches to the understanding of personality is the 
observation of the human being from the point of 
view of the behaver himself. This frame of refer- 
ence is in sharp contrast to the frame of reference 
usually used in psychology in which behavior is ob- 
Served from the outside or from the observer's 
point of view. According to the self-concept theory 
of personality, an individual’s behavior is deter- 
mined by the perceptions he has of himself as an 
individual and of the world around him. 

Adequately functioning personalities see them- 
Selves in essentially positive ways. They assume 
that they are persons that are liked, wanted, and 
valued for their own sakes, They become self-con- 
fident, self-assured, self-reliant m em bers of so- 
Ciety. Self-depreciation results in a falling off of 
effective functioning. Combs (1)feels that thou- 
Sands of people in our society are victims of their 
own self-concepts. Seeing themselves as inade- 
quate, they perform inadequately. 

That self-acceptance is a necessary character- 
istic for good mental health has been supported by 
a number of therapists. Carl Rogers (5) reports 
an observation which he feels he has made repeat- 
edly in client-centered therapeutic situations. He 
has found that whenever changes occur in the per- 
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ception of the self and in the perception of reality, 
changes occur in behavior. In therapy these per- 
ceptual changes are often more concerned with the 
self than with the external world. As treatment is 
directed toward a change of the concept of the self, 
the behavior is then changed. 

Research by Raimy, evaluating outcomes of 
therapeutic counseling, has revealed shifts of atti- 
tude in the successfully counseled cases to that of 
self-approval. From this study Raimy concludes 
that ‘‘what a person believes about himself is a gen- 
erally accepted factor in the social comprehension 
of others. The self-concépt theory predicated that 
each individual's perceptionofhimselfisof ultimate 
psychological significance in organized behavior. 
We act in accordance with our perceptions. ”” (4) 


The Problem 


Since mental hygienists and clinicians have re- 
peatedly demonstrated the і m portance of self-ac- 
ceptance for emotional maturity and mental health, 
then it is important for educators to become inter- 
ested in the problem of the teacher's attitude toward 
himself as a factor in the growth of a healthy per- 
sonality. Teachers must learnto accept themselves 
if they are to understand children and help children 
to learn desirable attitudes of self-acceptance. 

This study has beendesigned to measure the 
concept which the individual has of himself asa 
member of a social group and to determine the re- 
lationship between his self-concept and succ essful 
performance in student teaching. 


The Selection of Subjects 


АП of the students of Northern State Teachers 
College entering the part-time studentteaching pro- 
gram in the elementary grades during the school 
term 1958-1959 were rated independently by four 
trained observers as to their student teaching suc- 
cess. Preliminary judgments were first recorded 
on a detailed descriptive ratingscale regularly used 


* Adapted from a Ph. D. thesis completed at the University of Denver. 
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at the College to evaluate student teachers. For 
purposes of this study, this scale served primarily 
to help observers focus on all the various charac- 
teristics that should enter into an over-all assess- 
ment of student teaching success. 

An additional scale consisting of a single dimen- 
Sion, successful-unsuccessful, was devised to serve 
as a summary of the over-all performance ofthe 
student teacher. Іп an attempt to make differences 
between competences of teachers stand out as clear- 
ly as possible, ten intervals were chosen for this 
rating scale. Each student was ratedby the two co- 
operating teachers under whom he worked in the 
public schools over a period of two quarters and by 
the two college supervisors who visitedhim several 
times during each quarter. Thus each student re- 
ceived a total of four ratings; it was the average of 
these four ratings that served as the criterion of 
student teaching success in this study. 

The mean rating assigned by college supervisors 
was 6.67 with a standard deviation of 2.18. Тһе 
mean rating given by cooperating teachers was also 
6. 67 with a standard deviation of 2.10. The coeffi- 
cient of correlation between ratings of the college 
supervisors and the cooperating teachers was .77. 

On the basis of the average of the four ratings 
received by each student teacher, the nineteen stu- 
dents receiving the lowest averageratings were se- 
lected to compose the Inferior Student Teacher 
group. Their average ratings ranged from 3.50 to 
6.00 on the ten-point scale, with an over-all aver- 
age for this group of 4.98. 

The nineteen Inferior Student Teachers were 
matched on an individual basis with nineteen of the 
Students receiving the highest ratings. The major 
variables deemed important in setting up equivalent 
groups in this study were age, sex, amount of pre- 
vious teaching experience, grade level in college, 
intelligence, and socio-economic background. The 
Superior Student Teachers had ratings ranging from 
7.50 to 9. 50 with an over-all average for this group 
of 8.25. 

The thirty-eight subjects appearing in the final 
sample were all sophomore girls who were twenty 
and twenty-oneyears ої age. They had alltaken 
both years of college at Northern State Teachers 
College and all were in the process of completing 
the Two Year Elementary Education Program. All 
subjects had come from rural and small town back- 
grounds of South Dakota. None had had previous 
teaching experience. 


The Measurement of Self-Concept 


The Q-technique as firstdescribed by Stephenson 
(7) and modified by Rogers (6) and Raimy (4) was 
the method chosen for the measurement of self-con- 
cept. This technique makes it possible to assess 
the way the individual perceives himself and the way 
he describes an ideal person or the way he would 
like to be. The degree of congruence between these 
two assessments constituted the measure of self- 


acceptance in this study. 

Development of the Instrument— To secure a 
sample of items adequate for assessing the subject's 
self-concept, a population of traits had to bede- 
fined. This was accomplished through an extensive 
canvass of all available standardized tests measur- 
ing some aspect of social relationship, througha 
study of statements made in psychology andperson- 
ality books, through excerpts from recordings of 
counseling interviews, and from published and un- 
published lists of character traits. An initial list 
of approximately 1200 was reducedto 407 when am- 
biguous statements, duplications, and behavior not 
readily observable were eliminated. 

To insure a balance of traits andto check further 
possibilities for additions to the list, behavior traits 
representing related aspectsof behavior were 
placed into categories. After much further reduc- 
tion, revision, and elimination, the grouping re- 
sulted in fifteen major categories oftenitems each, 
Three college professors were asked to make the 
final selection of ninety items to constitute the sam- 
ple traits used in the study. The final steps involv- 
ed randomizing the behavior traits, assigningthem 
a number to simplify later identification, and typing 
them on small cards for ease of manipulation by 
subjects. 

Q-technique requires the subject to sort his 
cards according to a pre-determined frequency dis- 
tribution. This results ina greater number of com- 
parisons being made on the part of the sorter and à 
more penetrating analysis follows than from an un- 
controlled sort. The following frequency distribu- 
tion, platykurtic in shape, was used in this study: 
Score 012 34 5 6 78910 
Frequency 2 4 7 10 14 16 14 10 7 4 2 
Each array had a mean of five and standard devia- 
tion of 2. 26. 

To provide a simple procedure by means of which 
the subject might sort the items into the pre-ar- 
ranged frequency, poster board m ea s u ring twenty 
by twenty-eight inches was ruled into eleven divis- 
ions to represent the various categories in which 
the items might be placed from those on extreme 
left indicating those **least like” to those on the €x- 
treme right indicating those “‘most like" ће self 
(or ideal). The number of items thatcould be 
placed in each division was indicated by the number 
of boxes drawn vertically ontheboard. This type of 
sorting board made it possible for the subject to 866 
his entire sort at once and permitted him to rear- 
range his cards until he felt satisfied with his de- 
Scription. The score for each division was mark 
on the board making it easy forthe subject to record 
his own score. 

Administration of the Sort— Prior to each student 
teacher's reporting to the public schools for his 
first quarter of student teaching, he was asked 19 
Schedule two one-hour periods spaced a week apart 
with the examiner. During the first session 
subject was asked to describe himself as accurately 
as he could by placing the behavior trait cards 0n 
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the sorting board with thoseleastlike himself onthe 
extreme left and working gradually to the right side 
of the board where he would place those most char- 
acteristic of himself. Upon completion of the sort, 
which took on the average about thirty minutes, the 
students were shown how to record their scores. 
. The same materials were used in the second session, 

but this time the subject was askedto sortthe cards 
to describe the person he would like to be like, or 
his ideal self. 

To determine the degree of self-acceptance of 
each subject, the coefficient of correlation between 
the self sort and the ideal self sort was determined 
according to formula: 


ХҮ 
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The resulting coefficients ranged f rom .2501to 
. 1896 tor the Inferior Student Teachers and from 
. 3719 to . 8789 for the Superior Student Teachers. 


The Results 

In accordance with the recommendation of Gar- 
rett (2), the correlation derived from the scores on 
the Q-technique were translated into z scores for 
the remaining computations in this study. This was 
done by referring to Table 14 in the Statistical An- 
alysis in Educational Research by Lindquist (3). 
The mean, standard deviation, standard error of 
the mean, the standard error of th e difference be- 
tween the means of the two groups, the critical ra- 
tio, and the level of probability or significance of 
the findings were determined. A summary of the 
results of these computations follows: 

M S.D. 6m D бекер 


r—-——————————————mnÓ 
Superior 


Student .8553 .221 .0521 .078 2.90 .01 
Teachers 

Inferior 

Student .6289 .248 .0584 

Teachers 


ee es 


Thus the means of the cor relations between the 
two sorts for the two experimental groups were 
found to be significantly different. These findings 
are in accordance with expectation that significant 
and positive relationship exists between a. student's 


self-valuation and his success in student teaching. 
Discussion 


The findings in this experiment with student 
teachers seem to indicate that one factor in suc- 
cessful student teaching might well be the adequacy 
of feelings that the student has about himself as a 
person. Early identification of prospective student 
teachers harboring serious doubts about themselves 
and the provision of adequate counseling services 
might be expected to influence significantly the qual- 
ity of student teaching. The direction which coun- 
seling might take seems to be also indicated by this 
study. An analysis of the individual items has re- 
vealed that Superior Student Teachers and Inferior 
Student Teachers on the average have scored all 
items except two alike in describing the ideal per- 
Son, indicating that the aspirations or value systems 
of the two groups are not too different. However, 
in the process of describing themselves, the two 
groups score nine items with significant differen- 
ces. This would seem to indicate that in most cases 
therapy would be directed toward helping the indi- 
vidual view himself differently. These findings are 
in accord with that which Carl Rogers (5) has re- 
ported in his experiences‘in psychological clinics. 


REFERENCES 


1. Combs, A. E. *New Horizons in the Field'of 
Research: The Self-Concept,” Educational 
Leadership, XV (February, 1958), pp. 315- 

2. Garrett, H. E. Statistics in Psychology and Ed- 
ucation. (New York: Longmans, Green, and 
Company, 1947), p. 303. 

3. Lindquist, E. F. Statistical Analysis in Edu- 
cational Research. (New York: Houghton 
Mifflin Company, 1940), p. 215. 

4. Raimy, V. ''Self-Reference in Counseling In- 


terviews, ’’ Journal of Consulting Psychology, 
XII (April, 1948), pp. 153-165, 315-217. 

5. Rogers, С. R. ‘‘The Organization of Personal- 
ity," The American Psychologist, II (Sep- 
tember, 1947), pp. 358-368. 

6. Rogers, С. В. and Dymond, C. Ps chotherap’ 
and Personality Change (C hicago: University 
of Chicago Press, ). 


7. Stephenson, W. The Study of Behavior (Chicago: 
the University o: cago Press). 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 2, October 1961) 


AF: A New Approach to the Concept of Achievement 


THE INCREASE in thenumber of applicants 
seeking admission to institutions of higher learning 
re-emphasizes the need for reliable and valid pre- 
dictors of academic success (3). There is evidence 
which indicates that the current methods of grade 
prediction by the use of standardized tests are not 
wholly valid indices (1, 4). Evidently, there is more 
involved in academic success in college thanis ap- 
parentinany single factor such as aptitude, achieve- 
ment, interest, emotional stability, and so forth. 

The purpose of this study was to find a measure 
which would tap a source common to thefactors 
which produce academic success and by so doing, 
find a valid indicator. 


Method Тһе group selected for this study was 
Composed of all Freshmen entering Hampton Insti - 
tute during the Fall Semester of the 1956-1957 
School year. There were 182 females and 156 males 
in this group. 

The American Council on Education, P s ychol- 
ogical Examination for College Freshmen, 1949 
Edition (ACE): the California Achievement Tests, 
(CAT): andthe Otis Self Administering Test of Men- 
tal Ability, Higher Form A, (Otis) were administered 
to the entire group. 

The raw scores of all the measures in this battery 
were correlated (Pearson product-moment) with 
each student’s Grade Point Average (GPA) for males 
and females. 

Achievement Factors (AF) were calculated for 
each student and related to GPA. AF is defined as 
the relative difference between a student’s aptitude 
and achievement in standard scores in a given area. 
There were two areas calculated: quantitative (Q) 
and verbal (V). It was necessary to transform all 
raw scores іп the ACE, ACE;,, CATmath, and 
CATreading, into standard scores. The AFg was 

determined by subtracting the standard score in the 
ACE, from the standard score in the CAT math and 
adding 100 for each student. A similar method was 
used in calculating AFv, using the ACEL and 
CATreadingdata. Thus, a high AF indicates a higher 
relative performance in the achievement test than 
the aptitude test in a given area. 


ROBERT M. ROTH and JEAN GILBERT 
Hampton Institute 
Hampton, Virginia 


Results Тһе relationship of the various test scores 
and GPA is evidenced by the correlatious in Table I. 

The highest correlation for the GPA of males 
with any other single measure was 0.25 withthe 
ACEy,. This tends to suggest that for the most part 
the test scores had no significant implications for 
the prediction of academic success as measured by 
GPA. The highest correlations for the GPA of fe- 
males withany other measure was .24 between GPA 
and ACEL. 

Since examination of anyone of the measures 
alone revealed inadequate relationships with GPA, 
these relationships were then investigated with re- 
Spectto AF. The various relationships are expressed 
in Table II. A high correlation was noted in the AFq 
and GPA relationshipsfor males. A significant cor- 
relation between AF; and GPA was found for fe- 
males. However, this correlation did not prove to 
be sufficientlylarger thanthe correlations between 
АСЕт ог АСЕү and GPA. Nevertheless, large dif- 
ferences in correlation coefficients between the fe- 
males’ AFg and AF, was evidenced. 


Discussion The lack of validity in the prediction of 
grades by standardized tests is evidenced in the 
literature and found support inthis study. A different 
approach, the AF, was found to have validity in the 
prediction of grades. A sex bias in the results in- 
dicated a relationship between achievement and sex 
roles. These results point to the nature of patterns 
of achieving rather than single factors of ability, 
intelligence, and quantity of achievements as having 
a definite relationship to the attainment of grades. 

The implications of these findings are numerous. 
Basically, these findings imply thatacademic 
achievement is a pattern of behavior not wholly re- 
lated to one’s capability or skill. Similarly, Roth 
(2) found that success in a college reading improve- 
ment program was not related to ability, reading 
level, or achievement level, but rather the place 
that success in reading improvement had in the 
structure of the individual’s self perceptions. The 
sex bias evidenced in this study seems to suppor 
the idea that academic achievement is relatedt? 
non-intellectual factors. 


A—— n À ж. 
_ - 
TABLE I 
CORRELATIONS BETWEEN TEST SCORES AND GPA 
FOR MALES AND FEMALES 

Sex ACEg АСЕТ, ACEtot OTIS CATread CATmath | CATjang 

Male 11 .25 Eun .23 .19 121 312 

Female 213 224 221 11 LT .12 .14 


TABLE II 
CORRELATION COEFFICIENTS BETWEEN AF AND GPA 


AF 
Sex EAT ees аа. Ар 
Male 60 34 
Female -.28 .25 
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Much more must be understood about achievement 
to produce valid predictors of academic success. 
It is felt that AF is an inroad to the understanding 
of the phenomenon of academic achievement. Un- 
doubtedly, much more research is required in this 
area. 


Summary This study was an attempt to test the 
hypothesis that the Achievement Factor (AF) could 
be used as a valid indicator of academic success 
at the Hampton Institute. Academic success was 
considered in terms of grade point average. 

Test scores for the 1956-1957 Freshmen class 
were correlated with grades at the end of the fall 
semester. These results indicated that the levelof 
relationship between scores and grades earned was 
inadequate. However, the significance was achieved 
when AF was related to GPA. There was asex bias 
in these results. 

It was indicated in this study that academic a- 


chievement is related to patterns of achieving, al- 
though much more research is necessary in this 
area. The AF conceptis a suitable approach to this 


end. 
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Field News 


Readers are invited to send items of research inter- 
est to John Schmid, Professor, College of Education, 
University of Arkansas, Fayetteville, Arkansas. 


ANNOUNCEMENT OF A PILOT INFORMATION SERVICE OF 

EDUCATIONAL RESEARCH MATERIAL 
Are you reading more now and remembering less? 
If you are active or otherwise interested in the field of educational research, 
you are probably finding it more and more difficult to assure yourself that you 
are keeping up-to-date on current developments. If you are at all typical, you 
are probably trying to scan an inordinate number of source periodicals and re- 
ports in order to locate those articles and discussions that really interest you. 
Again, if you are at all typical, you are getting somewhat frantic or disgusted 
or justbored when confronted with the requirement of perusing so much mater- 
ial that is of little interest inorder to isolate those items that are desired. Апа 
if you are not one of a vanishing minority, youhave less time available for per- 
using and you are doing less of it than you know you should. 

If, by awave of a magic wand, you could be assured that by proper speci- 
fication of your interests through a question, a mechanized searching activity 
could notify you on a current basis of everything that is being published any- 
where in the world—would you wish to take advantage of such a facility? 

This facility has notyet beencreated, but the magic wand has been raised 
by the U. S. Office of Education in supporting the development of a pilot informa- 
tion service of educational research materials. The pilot study is being con- 
ducted at the Center for Documentation and Communication Research of Western 
Reserve University. The primary purpose of this program is to develop and 
put into operation on a pilot basis a suitable information retrieval system which 
will permit the detailed analysis of educational research material and selective 
dissemination, based on individual interests and requests on a current basis. 

The magic wand of a mechanized information r etr ieval service can be 
placed in your—the reader's—hand by cooperating withus today in making known 
how you might specify your interests, through a question or several questions, 
when such a service is offered to you. 

Let us give you some examples of alternate ways inwhich you might phrase 
a question and thus provide you with broader or narrower selections of the lit- 
erature: 

1. Send me everything published on “сопсері formation". 

2. Send only those materials which discuss **concept formation" in 
young mentally retarded children. 

3. Confine my question to ‘‘concept formation" in young, mentally 
retarded children as an indicator of educability. 


The examples given here become successively more selective, with the 


third one being quite narrow. 
Please send us questions in your fields that represent the subject matter 


as well as the generality of selectivity that might satisfy you in a service that 
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would keep you currently informed of everything going on in areas of educa- 


tional research of interest to you. > 
Your questions will be useful to the Center and to the U. S. Office of Ed- 


ucation in pre-testing the pilot system on the basis of user requirements. А 
significant sampling of questions will be searched and the results made avail- 
able to the sender for evaluation. 
Please send your questions to: 
Professor Allen Kent, Associate Director 
Center for Documentation and Communi- 
cation Research 
School of Library Science 
Western Reserve University 
Cleveland 6, Ohio 
Also, please urge your colleagues to send questions, too. 


HANDWRITING RESEARCH 

The Handwriting Foundationreports the resultsof a questionnaire sent to mem- 
bers of the American Educational Research Association and members of the 
Division of Educational Psychology of the American Psychological Association. 
The purpose of the inquiry was to determine the extent to which interest exist- 
ed inthis country in research in handwriting, particularly as regards factors 
related tolearningthe skill and factors affecting legibility. Ап outcome of this 
Study might result in the definition of a sub-sample of handwriting specimens 
to be photographically copied and placed in campus libraries strategically lo- 
cated throughout the country. These specimens would be available for longi- 
tudinal research studies. Further information about the work of this organiza- 
tion may be obtained from the Handwriting Foundation, 1426 С St., N.W., 
Washington 5, D. C. 


POST-DOCTORAL FELLOWSHIPS IN COLLEGE ADMINISTRATION 
The Center for the Study of Higher Education at the University of Michigan an- 


mum of $8000. Applications should be completed andon file not later than 
February 1, 1962. 


STRESS DURING EXAMS AND TEST SCORES 


Recently, John W. French of the Educational Testing Service undertook a study 
to determine the relationship between emotional States aroused at the time of 


in 16 public high Schools took the College Board SAT in January 1960. It was 
found that anxiety plays no role in the average results achieved by boys, but 


anxiety causes a slight boost in mathematical scores and a slight dip in verbal 
Scores for girls. 


THE COOPERATIVE RESEARCH PROGRAM 


In the May 1961, Newsletter of the Cooperative Resear 

ch Program of the U. S. 
Department of Health, Education and Welfare it is reported that 99 proposals 
Were received between S eptember 1 and December 1, 1960. Twenty-one of 
these proposals were recommended for approval. 


TRAINEESHIPS IN BIOMETRY 


The Department of Health, Education and Welfare announces the availability of 
a number of graduate traineeships in Biometry. Many competent, well-prepared 
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undergraduates, as well as more advanced students, are almost universally 
unaware of the possibilities of support available to them for graduate study in 
Biometry. Мапу doctoral students, both Ph. D.'s and M.D.'s are vitally need- 
ed to undertake the extensive training requiredfor a successful career inthese 
areas. To meet these changing needs for scientific manpower, a new and high- 
ly promising program of Biometry training grants, offering support for grad- 
uate students and post-doctoral fellows, has evolvedto complement the ongoing 
research fellowship program. Further information about these grants may be 
obtained by writing to 
Emmarie C. Hemphill, Exec. Sec. 
Advisory Committee on E pidemi- 
ology and Biometry 
Division of General MedicalSciences 
Department of Health, Education and 
Welfare 
Bethesda 14, Maryland 


RESEARCH IN MARYLAND 
Research projects under way this year in Baltimore County, Maryland, Schools 


include the following controlled experiments: 
1. The Teaching of Fourth Grade Arithmetic Using a Programmed Text 


2. The Effectiveness of Group Counseling on Academic Achievement of 
Underachieving Tenth Grade Students 


3. An Investigation of Remedial Reading versus Psychotherapy as Treat- 
ment for Severely Retarded Readers 


4. An Investigation of the Effect of Large Class Size on Achievement of 
Students in Eighth Grade Social Studies 


5. The Effectiveness of the Individualized Reading Approach in Teaching 
Reading in Elementary School 
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. It Says in the JER... 


THE ATTITUDES OF EVEN A SOPHISTICATED GROUP can be changed by a well- 
conceived educational program, Rosinski reports as the result of research 
in connection with an intensive medical teaching institute at the University of 
Buffalo. “Хо one is too old or too rigid to change his mind, ’’ the author 
concludes. 


OUR SOCIAL SCIENCE TEXTBOOKS AREN'T VERY GOOD, Dimitroff says, She 
looked for 15 recommended generalizations in 30 intermediate-grade geog- 
raphy, history, and social-studies texts, found them to be ‘‘unsatisfactory 
in quantity and quality. ’’ | 


ORAL READING PERFORMANCE is a general skill that isn't necessarily defined 
by oralreading errors, McCracken thinks. The number of errors a child 
makes is important but the kind of errors is not. Comprehension is a sepa- 
rate skill. Fluent oral reading does not insure high comprehension, nor does 
poor oral reading mean poor comprehension. 


A COURSE IN CHILD DEVELOPMENT can measurably influence a stude nt's atti- 
tudes toward children, Leton reports, but it does not seem to make any dif- 
ference what instructional method is used—lecture, case study, or group 
discussion. Attitude change is positively correlated with achievement scores, 
he says, on the basis of research at UCLA. 


HOW DOES A JUVENILE DELINQUENT DIFFER FROM A NON-DELINQUENT? Іп 
a study of parolees from the Lyman School for Boys at Westboro, Mass., Litwack 
found that parole violators (recidivists) 1) were shorter than the boys who 
made a success of parole (non-recidivists), 2) had younger mothers, 3)were 
younger themselves, 4) had less education, 5) tended to have fathers with 
court records and jail sentences, 6) had spent time in the discipline cottage; 
and 7) re-entered a public school instead of going to work. 


THERE IS A SUBSTANTIAL RELATIONSHIP between vocational interests and oc- 
cupational proficiency, Stone hypothesizes on the basis of a preliminary 
study involving stenographers. 


A CHILD'S I. Q., HINGES ON HIS FAMILY BACKGROUND, specifically on his 
father's occupation, social mobility, and education, the Schmucks report after 


studying the relationships between sociological variables andtest performance 
among fourth-grade pupils in a Detroit suburb. Children from distinctly 
upwardly-mobile families tended to have highI. Q.'s; negligibly upward- mobile 
families tended to have low-I.Q., children. Fathers in ‘‘head’’ jobs tended 
to have high-I. Q., children; offspring with low I. Q.^s tended to come from 
fathers with ‘‘hands’’ occupations. Furthermore, fathers with more than 12 
years of formal education tended to have high-I. Q., children, whilefathers 
with 12 years or less of formal education tended to have low-I. Q., children. 
Incidentally, pupils whose fathers had “һапбв” occupations and whose 
mothers worked outside the home tended to score in the low-I.Q., group 96 
percent of the time. ‘‘None of these results should be astounding for anyone 
who thinks back on the crude procedures Binet used for validating his first 
intelligence tests," the Schmucks point out, and they add: ‘‘It may be well 

for educational psychologists to look more closely at just what they are meas- 
uring when they administer and score an I. Q., test." 


WHETHER OR NOT A WORD IS DIFFICULT OR EASY TO READ OR SPELL does 
not dependon either its connotative emotional tone or its emotional intensity, 
at least not in school situations, Bloomer reports. 


WHO INFLUENCES STUDENTS TO TAKE COURSES IN ADVANCED MATH? A high 
percentage are influenced by their parents, a lesser percent by various School 
personnel, and over one-fourth by their friends, so reports Parkinson from 
a Wisconsin study. The situation points up the need for agreement among 
all concerned on the purposes and value of such courses, he says, but agree- 


ment does not now exist. 
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School Influence in Student Choices of 


-High-school Mathematics Courses 


RECENT NATIONAL concern r eg arding enroll- 
ments in mathematics has pointed up the need for 
information concerning the ways in which students 
select courses in this area and the means used by 
schools to influence these selections. This paper is 
intended to report certain research, conducted in 
Wisconsin, regarding this topic. 

In the spring of 1957, graduating seniors in Wis- 
consin private and public high school were asked, 
in connection with another study, to indicate the 
courses which they had taken in high school. Re- 
plies were received from 34,151 seniors, repre- 
senting approximately 89 percent of those in private 
high schools and 96 percent of those in public ones. 
From this group a sample of 5,675 students was 
taken, representing one senior out of every six in 
each high school. Of this sample, 1,985 students 
stated that they had followed a c ollege-preparatory 
course of study or track. Higher percentages of 
this college-preparatory group had taken courses in 
mathematics, science, and foreign language than 
had non-college-preparatory students. When these 
Students were broken down into quartiles on the ba- 
Sis of class rank, it was found that 91.6 and 75.3 
percent of the college-preparatory students in the 
lowest quartile had taken algebra and geometry in 
comparison with 85.4 and 54.2 percent of the non- 
college-preparatory students in the highest quartile. 
Similar findings appeared when the breakdown was 
based on scores received on the Henman-Nelson 
test*. This seemed to indicate that college-prepar- 
atory status was an important factor in deciding 
Whether or not students would take courses in alge- 
bra and geometry, or general mathematics. 

The question of whether students should be en- 
Couraged to take particular courses in mathematics 
9n the basis of intelligence, occupational objectives, 
College-preparatory status, or some other factor 
has yet to be decided. However, it is proper to ask 
What role the school has inthese selections. This 
Paper is therefore directed toward finding out who 
in Wisconsin high schools is responsible for guiding 
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Students into courses in mathematics, which of a 
Series of items are regarded as most important in 
School policy for this guidance, and what individuals 
a selected group of students consider most impor- 
tant in helping them to decide on courses in mathe- 
matics which they are taking. As it was desired to 
study situations in which students had a choice be- 
tween courses in mathematics, the study was limit- 
ed to schools offering both albebra and general 
mathematics and whose graduation requirements did 
not specify either of these courses. 

On this basis, a group of 219 schools was selec- 
ted and questionnaires were sent tothe principal and 
the head of the department of mathematics in the 
fall of 1958. Of these, 198 responded and 32 were 
excluded leaving 166 schools, or 83. 8 percent of the 
198 returns, to be includedin the study. Because 
of the criterion regarding graduation requirements, 
it was impossible to tell ifa school would have been 
included in the study until returns from it were ex- 
amined, and therefore it is only possible to say that 
the 166 schools represent a percentage of a poten- 
tial population in Wisconsin whichis somewhere be- 
tween 89.2 and 100.0. These 166 schools are bro- 
ken down on the basis of whether or not they have a 
college-preparatory track, 95 having such a track 
and 71 not having it. 

Five individuals are given responsibility for the 
guidance of students in regard to mathematics by 
over ten percent of the schools: the principal, the 
guidance director, the mathematics teacher, the 
home room teacher, and the school counselor 
(Table I). Among schools with а college-prepara- 
tory track, 61.1 percent listthe principal as impor- 
tant, 41.1 percent the guidance director, 34.7 per- 
cent the mathematics teacher, 26.3 percent the 
home room teacher, and 16.8 percent the school 
counselor. Among schools without sucha track, 
64.8 percent list the principal, 38.0 percent the 
mathematics teacher, 22.5 percentthe guidance di- 
rector and home room teacher, and 12.7 percent 
the school counselor. Тһе principalis thus listed 
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passim. 
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TABLE I 


PERCENTS OF SCHOOLS WITH AND WITHOUT COLLEGE-PREPARATORY TRACKS 
THAT PLACE RESPONSIBILITY FOR STUDENT GUIDANCE REGARDING COURSES 
IN NINTH GRADE MATHEMATICS UPON PARTICULAR INDIVIDUALS 


ш=—=————Є;ү———Є———Є—Є—Є—Є—Є&Є—Є—Є—————Є—Є—Є—Є—Єү—ү—Є—Є—Є—Є—Є—Є 


With College- Without College- 
Preparatory Preparatory 
Individual Tracks Tracks 
N- 95 N-71 
Principal 61.1 64.8 
Guidance Director 41.1 22.5 
Mathematics Teacher 34.7 38.0 
Home Room Teacher 26.3 22.5 
School Counselors 16.8 12.7 
Vice-Principal 9.5 7.0 
Eighth Grade Teacher 5.3 1.4 
Class Advisor 2.1 1.4 
Any Teacher 2.1 2.8 
Superintendent of Schools dodi EU 
Elementary School Principal ---- 1.4 
No Reply ---- 1.4 
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AVERAGE RATING* BY SCHOOLS WITH AND WITHOUT COLLEGE-PREPARATORY TRACKS 
OF A SERIES OF ITEMS TO SHOW THEIR IMPORTANCE IN SCHOOL POLICY FOR THE 
SELECTION OF MATHEMATICS COURSES THAT STUDENTS TAKE IN NINTH GRADE 


iS 


Item 


.. _________ мшш ava с эзы —-ш—1 


Score on a standard test of mathematical ability 
Grades in previous work in mathematics 

Reports from the eighth grade teacher 

I. Q. score 

Occupational objectives of the student 

Presence or lack of intention to go to college 
Student motivation in general 

Parental opinion of the courses the student should take 
Track in which the student is enrolled 

Score on test other than one of mathematical ability 
Student personality characteristics and conduct 
Health 


Attendance 


With College- Without College- All Schools 

Preparatory Preparatory 
Tracks Tracks 
N - 95 N-71 N - 166 
3.6 3.7 3.6 
3.6 3.3 3.5 
3.5 3.2 3.4 
3.2 3.3 3.2 
3.0 2.9 2.9 
2.8 2.9 2.8 
2.7 2.8 2.7 
2.6 2.5 2.5 
2.6 1.4 21 
2.1 1.9 2.0 
ЕЛ, 1.9 1.8 
1.7 1.8 1.7 
1.8 1.6 Lat 


a eee ee eee 


* Items were rated from one to five, Wi 
was extremely important. 


th one indicating an item of no importance and five an item which 
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TABLE III 


PERCENT OF STUDENTS TAKING ELEMENTARY ALGEBRA OR GENERAL MATHEMATICS 
IN ONE CLASS EACH IN SEVEN HIGH SCHOOLS WHO STATED THAT CERTAIN 
INDIVIDUALS НАГ INFLUENCED THEIR DECISION TO TAKE THIS COURSE 


m -— —— —————————————- AÉlÓ——————————ÀMÁ O 


Individuals Elementary Algebra General Mathematics Total 
N - 169 N - 155 N - 324 

Parents 60.9 45.2 53.4 
Mathematics Teacher 43.2 18.1 31.2 
Friends 21.2 30.3 28.7 
High School Principal 13.6 32.9 22.8 
School Counselor 17.8 12.3 15.1 
Home Room Teacher 13.6 8.4 11.1 
Brother or Sister 10.1 5.2 7.7 
Eighth Grade Teacher 4.7 2.6 3.7 
Relatives 2.4 1.9 2.2 
Boy or Girl Friend 1.2 --- 0.6 
Grade School Principal 0.6 --- 0.3 
A College Student 0.6 --- 0.3 
A Doctor --- 0.6 0.3 
No One 4.7 9:7 Tt 
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as important by a larger group of schools withouta 
college-preparatory track than Schools with such a 
track, while this situation is reversedfor the guid- 
ance director, home room teacher, and school 
counselor. This seems to indicate that the princi- 
pal generally takes greater direct influence on the 

‚ guidance of students in schools without a college- 
preparatory track than in schools with it, and that 
in schools with such a track, more direct influence 
is exerted by persons in a formal guidance organi- 
zation. 

A series of thirteen items was submitted to each 
principal and he was asked to rate each from one to 
five regarding their importance for the guidance of 
students into courses in ninth-grade mathematics. 
A rating of one means an item of no importance and 
a rating of five an item which is extremely impor- 
tant. Four of the thirteen items received an average 
rating of 3. 0 or higher by all of the schools (Table 
П). Тһе highest rated item is''score on а standard 
test of mathematical ability" which received an 
overall rating of 3.6. This is followed by “‘grades 
in previous work in mathematics?’ whichis rated 3. 6 
by schools with a college-preparatory track, 3.3 
by schools without this track, and 3.5 by all of the 
Schools. The third item is ‘‘reports from the eighth 
grade teacher’’ which received an average rating of 
3.5 from schools with a college-preparatory track, 
3.2 by schools without thistrack, giving an overall 
average of 3.4. The fourth item, “I. Q. Score," 
received an average rating of 3.2 by all ofthe 
Schools. Another item, **occupational objectives of 
the student, ’’ followed these with an average rating 
of 2.9. The remaining items, including college іп- 
tentions, motivation, and parental opinions, were 
rated considerably lower than those given above. It 
seems fairly clear from an examination of the high - 
est rated items that these reduce themselves to dif- 
ferent measures of the ability of students to succeed 
in courses in mathematics, and that other items 
tend to have relatively less importance. 

_Previous research, as stated in the beginning of 
this paper, showed that senior students, when broken 
down on the basis of whether or not they were pre- 
Paring for college, also showed a division in the 
Proportions taking algebra and geometry. If schools 
attempt to encourage students to take these courses 
Оп the basis of some measure of their ability to do 
the work, then it appears that something else, other 
than encouragement of school personnel, must enter 
into this system and influence student choices of 
Courses in mathematics. In order to gain an indi- 
Cation of what this might be, one class each in alge- 
bra and general methematics in seven high Schools 
е asked to indicate who had influenced them to 
le à that particular course. The schools were se- 
AP ей to cover a wide range of enrollments, and the 

asses were selected on the basis of availability. 
Therefore, the student re sponses, while they are 
ue to be indicative of the responses which 

ight be made, are still not an adequate, random 


sample of all of the students taking algebra and gen- 
eral mathematics in the schools included in the 
study. 

Six cetegories of individuals were checked by o- 
ver ten percent of the 324 students in elementary 
algebra and general mathematics as influencing 
them to take this course (Table Ш). Parents were 
checked by over sixty percent of the algebra stu- 
dents, by over forty-five percentofthe general 
mathematics students, and by 53.4 percent of the 
entire group. А mathematics teacher was checked 
by about forty-three percent of thealgebra students, 
by about eighteen percent of the general mathematics 
students, and by 31.2 percent of the entire group. 
Friends were checked by about thirty percent of all 
ofthe students. The high school principal was 
checked by about thirteen percent of the algebra 
students, almost one-third of the general mathe- 
matics students, and by slightly over one-fifth of the 
entire group. School counselors were checked by 
close to fifteen percent of all the students; and the 
home room teacher was checked by close to thirteen 
percent of the algebra students, about eight percent 
of the general mathematics students, and by about 
eleven percent of the entire group. It thus appears 
that a high percentage of these students were influ- 
enced by their parents, and that lesser perc ents 
were influenced by various school personnel. Over 
one-fourth were influenced by their friends. It also 
appears that many of these students were influenced 
by more than one individual. For this reason, itis 
unfortunately impossible to provide an index of the 
total influence of the school by adding together the 
percents of students who were influenced by school 
personnel. 

The data received from students, fragmentary 
though it may be, does give an indication of a pos- 
sible solution to the problem posed bythe disagree- 
ment found between the efforts of schools and the 
actual student enrollments in mathematics. If the 
influence of parents and other individuals outside of 
the school is important in the decisions whi ch stu- 
dents make concerning their courses in mathemat- 
ics, then it follows that the efforts of schools in this 
matter would not be completely effective. At the 
same time, the fact that a number of influences ap- 
pear to be operating in student choices of courses 
in high school mathematics points up the need for 
agreement among all concerned regarding the pur- 
poses and value of these courses. Unfortunately, 
as school personnel seem to regard intelligence as 
the most important factor in course selections while 
other persons influencing students seem to place 
some emphasis upon college-preparatory status, 
this agreement apparently does not exist. 
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Connotative Meaning and the Reading and 


Spelling Difficulty of Words 


THE PRESENT STUDY deals with the relation of 
two connotative aspects of the significates of words 
as they affect the reading and spelling difficulty of 
those words. These connotative aspects are the e- 
motional tone and the emotional intensity of the 
words. 

Regarding the relation of em ot ional tone to the 
learning difficulty of words, considerable conflicting 
evidence exists. By and large, where free selection 
of the words is a function of the learner or user, 
positive words seem to be more often used and more 
often correctly used than negative words. Hunt (T) 
and Davis (1) both indicated a prepotence of positive 
words which are used earlier and more often by 
children. Ulrich (12) and Travis (11) separately 
found that misspellings by neurotic children tended 
to be a function of negative words. Studies of mem- 
огу and emotional tone reviewed by Meltzer (8) in- 
dicated a prepotence of positive memories. 

Conversely, in situations where children were 
required to learn words asa spelling problem, 
Frandsen (2) found little relationship between the 
Spelling difficulty of the word and whether the child 
liked or disliked it. Similarly, Smith (10) found no 
difference between hostile and non-hostile subjects. 
These studies suggest that connotative aspects may 
not be as important when the child is required to 
learn the word as they are when the child is given a 
Íree expression situation. 

_ Emotional intensity as it relates to learning dif- 
ficulty is similarly a complex. On the surface it 
would appear that those stimuli having the greatest 
impact would tend to be retained more often and 
therefore Should be easier tolearn. However, as 
Zipf (13) points out, as the frequency of occurrence 
of a word increases, the intensity tends to diminish. 
Atthe same time, as frequency increases, word 
length diminishes. Both frequency and word length 
are related to the learning difficulty of spelling and 
reading words. Goodstein (4) worked with the single 
dimension of degree of positiveness which may be 
interpreted as relating to the positive intensity of 
the stimulus. He determined recognition threshold 
ur а range of positive stimuli and found the corre- 

ation to be .53. However, when frequency of oc- 
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currence was partialed out, that correlation coeffi- 
cient was reduced to .18, which would support 
Zipf's notion that intensity is relatedto frequency of 
occurrence. 

Hull (6) examined the problem of intensity in 
terms of the sequence of learning. He speculated 
that if learning proceeds from the most intense 
stimulus to stimuli of lesser intensity, the prob- 
ability of error or confusion between the stimuli 
would be considerably lower than whenlearning pro- 
gresses in the opposite direction. If, then, less in- 
tense stimuli are presented to the reading learner 
by virtue of the fact that they are more frequent, a 
lower slope to the generalization gradient anda 
higher probability slope of confusion would be ex- 
pected. 

The literature suggests first that connotative 
positiveness and negativeness of words may affect 
word selection and difficulty when the respondant is 
free to select his words, butnot when the words are 
selected for him. Second, the connotative intensity 
of the meaning of a word tends to act in opposition 
to the frequency of occurrence of the word. Thus, 
any effect of connotative intensity within the words 
presented to children in school will be negligible. 

Hence, in the present school situation it would 
appear that the connotative aspect of words would 
bear little relation to the learning difficulty of the 
words. The present study exam ined the relation- 
ship between the emotional intensity an d emotional 
tone of words and the spelling and reading difficulty 
of words. 


Procedure 


The population for the study was 149 words se- 
lected at random from The New Iowa Spelling Scale 
by Greene (5). Each word in the sample also oc- 
curred in Gates (3). 

The criteria were as follows: 

1. The grade level at which fifty percent of the 
children spelled the word correctly was derived 
from The New Iowa Spelling Scale by linear interpo- 


lation. 
2. Reading difficulty was taken from Gates (3) 
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and it represented the grade level at which fifty per- 
cent of the children tested could read the word and 
Select the appropriate meaning from a multiple 
choicelist. Admittedly this was a relatively stiff 
criterion for reading difficulty since the child must 
be able to read not only the word itself but also the 
words of the definition. Gates states, however, that 
the words were defined by words occurring more 
frequently than the word tested. Presumably, then, 
the difficulty of the word was the determining factor. 
Words were used in the present study only if Gates 
presented a grade level at which fifty percent of the 
children read the word. 

There were several variables. 

1. The emotional tone of a word was a measure 
of the positiveness or negativeness of the meaning 
of the word. Forty-five college junior students 
were given а list of 550 wordsarrangedin a random 
order. The list containedthe 149 words ofthe sam- 
ple interspersed at random among the total list. 
The students were asked to mark whether they felt 
the word to be positive or negative. The sum of the 
judgments of the group for each word represents 
relative positiveness or negativeness. 

Reliability of the judgments: Fifty ofthe words in 
the sample were repeated a second time. Since 
there is evidence that the context of the judgment 
will reflect in the judgment itself, it was necessary 
to determine whether or not this context was impor- 
tant enough to affect the judgments in the present 
study. Therefore a correlation between each of the 
two judgments for these words was employed to de- 
termine the effect of this context. The correlation 
coefficient was r = .78, indicating a high degree of 
reliability between the judgments of the words in 
differing contexts. 

2. The emotional intensity of a word was a mea- 
sure of the stimulus strength of the meaning. For- 
ty-seven students were given the list of words and 
were asked to judge whether they felt the words to 
be strong or weak. The sum ofthe judgments of the 
group represented the relative intensity of the word. 

Reliability of the judgments: The procedure for 
determining reliability was the same as above. The 
correlations between the two judgments ofthe words 
yielded a coefficient of г = .76, indicating that the 
judgments of the words in different contexts had a 
high degree of reliability. 

3. The frequency of occur rence of a word was 
taken from A Basic Vocabulary ofElementary 

School Children by Henry Rinsland (9). In each case 
the frequency of occurrence employed was the total 
occurrence of the word in all eight grades. 

4. Word length was determined from the number 
of letters in the word. 


Results 


Emotional tone and reading and spelling difficul- 
y: Table I indicates that there was no relationship 
etween the spelling difficulty and the emotional tone 
f words. The positiveness or negativeness of a 


word apparently, then, had little relation to whether 
or not a word was difficult to spell. Low, negative, 
significant relationships were obtained between e- 
motional tone and the gradelevel at which fifty per- 
cent of the children could read the word correctly, 
This indicated a very slight tendency for children to 
learn to read words positive in meaning somewhat 
earlier in their school careers. This relationship 
may be seen as a function of the fact that positive 
words tend to be more frequent; hence chil dren, 
taught frequent words earlier in school, will tend to 
learn more of the positive ones. The partial cor- 
relation between reading difficulty and emotional 
tone with frequency held constant was -.158 and not 
significant. 

Emotional intensity and reading and spelling dif- 
ficulty: Table I indicated a positive relationship be- 
tween the grade level at which fifty percent of the 
words were spelled and intensity. There was а ten- 
dency for intense words to be very slightly more 
difficult to learn to spell than words which are not 
intense. Much the same result was indicated for 
the reading difficulty of words. Again, in interpret- 
ing these data it is important to remember that the 
limited range of frequency of the words in the study 
would tend to reduce the correlations. Following 
Zipf's notion, we would expect that more intense 
words would tend to be less frequent as well as long- 
er. These substantiations are borne out in Table I. 
Within the limited restricted range of the sample, 
these two variables appeared to be more important 
than the intensity variable and tended at the same 
time to give an inverse relationship to the one which 
might be theoretically expected, i.e., intensity be- 
ing positively related to word length, and negatively 
related to reading and spelling difficulty. Тһе fol- 
lowing partial correlations were obtained: intensity 
and spelling difficulty with frequency constant, r= 
+ 152; intensity and reading difficulty with frequency 
constant, г = .166. These relationships werenot 
significant and indicated thatthe small relationship 
between intensity and spelling and reading difficulty 
was at least in part a function of frequency of occur- 
rence. 

When word length has held constant, the relation- 
Ship between spelling and intensity was r = . 128, 
and between reading and intensity, r - .192. These 
relationships, similarly, were not significant and 
indicated that word length was also a contributor to 
the slight relationship between intensity and reading 
and spelling difficulty. When both word length and 
frequency were held constant, the relationship of in- 
tensity to spelling difficulty was г = .074, and be- 
tween intensity and reading difficulty, r= .149. 
This indicated that intensity, in and of itself, has 10 
relationship to reading or spelling difficulty of 
words within the limited range of frequency of 0C- 
currence from which this sample was taken. Whe- 
ther this finding would be true for a wider range of 
frequency of occurrence remains subject to further 
investigation. 

Emotional tone and emotional intensity: Table 1 
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indicated the relationship between emotional tone 
and emotional intensity to be r = -.003, indicating 
that no relationship existed. However, emotional 
intensity may be seen as degrees of positiveness or 
negativeness. Hence, highly positive or highly neg- 
ative words would tend to be more intense than those 
which are only moderately positive or negative. 

Figure 1 describes the relationship between the 
positiveness or negativeness of the meaning of the 
words and the intensity of the meaning. The curve 
Shown is bimodal. If the words are divided into 
positive words and negative words at class interval 
nine, intensity relates negatively to the emotional 
tone when negative words are selected, and posi- 
tively when positive words are selected. The rela- 
tionship between negative words and their intensity 
was r= -.488, indicating that words most often 
judged negative tended to be judged as more intense. 
Conversely, when positive words were related to 
the intensity of the words the correlation was . 379, 
indicating that words more often judged as positive 
words tended to be more intense. 

The degree of positiveness or negativeness, how- 
ever, did not relate to the spelling difficulty or 
reading difficulty of the word. For spelling diffi- 
culty the relationship for negative words was г = 
.054. The relationship between spelling difficulty 
and the degree of positiveness of words was r- . 034. 
Similar correlations for reading difficulty indicated 
r = -, 048 for degree of negativeness and reading 
difficulty, and г = .034 for degree of positiveness. 
These data indicate that while emotional intensity 
and degree of positiveness or negativeness are re- 
lated to one another, this relationship does not af- 
fect the spelling or read ing difficulty of the words 
within the limited range of frequency with which the 
study was concerned. 

Frequency of occurrence and spelling and reading 
difficulty: Table I indicates the relationship of fre- 
quency of occurrence to spelling and reading diffi- 
culty. When the frequency of occurrence as taken 
directly from Rinsland (9) is related to spelling and 
reading difficulty, the correlation coefficients indi- 
cated that as a word becomes more frequent, the 
grade level at which children tend to learn to read 
and spell the word decreases. This can be seen in 
part at least as a function of the fact that words for 
reading materials, and to some extent for spelling 
materials, are graded in relation to frequency of 
occurrence. 

Word length and spelling and reading diffic ul ty: 
Table I indicated a strong relationship between the 
spelling difficulty of a word and its length. On the 
ther hand, the relation of word length to reading, 
vhile significant, was relatively low. This seemed 
o point to a difference in the two processes. In 
pelling, the unit for response is the single letter; 
ence, spelling difficulty is a function of the number 
f responses made. In reading, on the other hand, 
he unit for response is the whole word. 

The relationship between reading difficulty and 
ord length may be seen in part to be a function of 


the fact that longer words tend to be less frequent, 
The partial correlation between reading diffi culty 
and word length with frequency constant was r= 
.110, and not significant. The relationship between 
word length and spelling difficulty with frequency 
constant was relatively strong and significant, r= 
. 469. 

The relationship between intensity andword 
length plays a relatively small part in the relation- 
Ship of word length to spelling difficulty and reading 
difficulty. Partial correlation between word length 
and spelling difficulty with intensity held constant 
was r = . 573 and for reading difficulty, r = . 237. 

Zipf (13,,14) postulated that as words are more 
frequently used, they tend tobe shortened. He gave 
examples such as the word ‘‘aeroplane’’, which was 
shortened first to **airplane", and then to *'plane." 
Following this reasoning, there should be a rela- 
tionship between the frequency of occurrence of 
words and their length. Table I indicated a very 
mild relationship. For the present sample, there 
was a tendency for shorter words to be more fre- 
quent. Zipf (13, 14) also suggested that intensity of 
a word was dependent upon its frequency; the more 
often a word is used, theless its impact since its 
users become accustomed to 4. TableIshowed 
correlations of -.198. This relationship was sig- 
nificant at the .05 level, and indicated that, toa 
mild degree, more frequent words tended to be less 
intense. However, intensity appeared to bemore 
closely related to word length than to frequency (r 
= .250). The partial correlations indicated that the 
relationship between intensity and frequency is 
largely a function of the factthat more intese 
words tend to be longer rather than shorter. When 
word length was held constant, the relationship be- 
tween intensity and frequency (г = .109) was not sig- 
nificant. Conversely, whenfrequency was held con- 
stant, relationships between intensity and word 
length were significant (r - .202). 


Discussion 


Some differences in relation tothe variables ap- 
pear between reading and spelling difficulty. Emo- 
tional tone, though barely significant, is more im- 
portant іп reading than in spelling. This result is 
reasonable in light of the fact that reading learning 
is complicated by a demandfor the understanding of 
the meanings of words. Similarly, diminished re- 
lationships of frequency and word length to reading 
difficulty as opposed to spelling difficulty attest to 
complications in the reading learning process that 
arise from the need for understanding. 4 

The literature suggested conflicting results in 
terms of the relationship of positiveness or nega- 
tiveness and the use ог learning of words. By and 
large, positiveness or negativeness appear to be 
factors in word selection or error where the re- 
sponse is not dictated from outside the individual. In 
the reading and spelling learning situation, even 
though the words may have first been generated by 
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the children themselves, the children по1опрег have 
a selective control over the words. The words, 
then, become tasks; in spelling, thetask is to name 
or write the letters in appropriate sequence, and in 
reading, the task is to name the word and to give a 
meaning. It is obvious that the major reinforce- 
ments for these tasks do not c o me from the mean- 
ings of the words, but rather from the successful 
completion of the task. The connotative meaning, 
then, is of relatively little importance inthe learn- 
ing situation. This is not to suggest that emotion- 
ally positive or emotionally intense passages will 
not be remembered longer or used more frequently. 
On the contrary, evidence from other studies indi- 
cates that material with emotional impact for an 
individual is more likely to be used byhim. The 
study does suggest, however, that the learning of 
emotionally-loaded material within the usual limit 
of the school situation will not be mollified by the 
emotional impact of the words. 

The present study seemed to bear out Zipf’s con- 
tention (13) relating emotional intensity to frequency 
of occurrence and to word length. The relation- 
ships were small. However, the present sample 
was drawn from a population of 5, 500 of the most 
frequent words, and if Zipf is correct, the range of 
intensity has been sharply limited by this selection. 
It would appear that the frequency intensity rela- 
tionship might be increased if a full range of fre- 
quency were used. Further investigation is neces- 
sary to answer this particular question. It is suf- 
ficient to say that intensity per se is not a variable 
in learning difficulty within the presentlimits com- 
monly employed in the schools. 

A further question is raised by Hull’s delibera- 
tion (6) concluding that the slope of the generaliza- 
tion gradient is decreased as a funct ion of apro- 
gression from weaker tomore intense stimuli. 
This position would argue for an organization of 
stimuli giving prior presentation to stimuli of great- 
est intensity. Such an organization might be possi- 
ble, using word length rather than frequency asa 
principle for the sequential organization of difficul- 
ty. It would appear, however, that the prepotence 
of frequency as a difficulty variable would still give 
inverted results to such an organization of stimuli. 


Summary and Conclusions 


The review of the literature on the effects of 
connotative meaning on word difficulty indicates that 
the emotional positiveness and negativeness of the 
meaning of words is a factor in their difficulty only 
in a free selection situation. Emotional intensity 
appears to interact with frequency of occurrence in 
a negative fashion such that more intense words are 
less frequent, hence more difficult. The present 
study related the difficulty of learning to spell and 
read a sample of 149common words by school 
children to the judged connotative positiveness-neg- 
ativeness, the intensity of the meanings, frequency 


of occurrence, and the length of the word. 

Both frequency of occurrence and word length 
were significantly related to learning difficulty of 
words. Children tended to spell and read more 
readily words of positive connotation, however, this 
was a result of the relation between positive conno- 
tation to frequency of occurrence. Similarly, more 


intense words were slightly more difficult, but this ` 


relation was a function of the relationship of in- 
creased intensity to longer andless frequent words, 
Within the limits of the sample of words taught in 
School, then, learning difficulty of reading and 
spelling words bears no relation to either connota- 
tive emotional tone or emotional intensity of the 
word. 
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The Oral Reading Performance of a Second Grade Class 


Using an Informal Reading Test 


THE USE OF the informal reading inventory has 
been recommended as a clinical diagnostic proce- 
dure and as a regular classroom teaching technique 
(1, 2, 4, 6, 8, 16). Socher (12) reported thata 
Study using an informal reading inventory with upper 
elementary school children is currently being con- 
ducted at Temple University. No research during 
the period 1930-1959 using the informal reading in- 
ventory as here proposed could be found by the au- 
thor in the professionalliterature (13, 14, 15,5). 
Sheldon and Hatch (9, 10, 11)and Hatch and Sheldon 
(7) compared the individual reading performances of 
groups of good and poor readers in grades 2, 3, 4, 
and 6 as measured by the Durrell Analysis of Read- 
ing Difficulty. They found that: 

a. Word by word reading was a common charac- 
teristic in good and poor readers in second grade, 
ànd а common characteristic of only poor readers 
in grades 3, 4, and 6. 

b. Poor or scanty unaided recall was common to 
both good and poor readers after oral and silent 
reading. 

с. At second grade level, both good and poor 
readers tend to ignore punctuation. 

. Q. At the second gradelevel the error occurring 
in the greatest number of cases was the substitution 
of whole words which were similar in form. 

€. Low sight vocabulary was a common charac- 
teristic of poor readers in grades 2, 3, and 4. Six- 
ty-eight percent of second grade pupils, 81% of third 
grade pupils, and 68% of fourth grade pupils classi- 
fied as poor readers were reported as having low 
Sight vocabularies. No percentages were given for 
the good readers. Characteristics were reported 
only when 50% of a group had the characteristic. 
$ Durrell (3) reports that the average reading rates 
‘or children examined at the Boston University Clin- 
10 were 45 words per minute for first grade, 80 
Words per minute for second grade, and 110 words 
Per minute for third grade. He does not report how 

18 was measured. 
A The purpose of this study was to record the oral 
And performances ofthirty-six second grade 
dren in reading materials below grade level, at 
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grade level, and above grade level, and to compare 
the oral performances of the good, average, and 
poor readers. 


The Subjects 


Thirty-six pupils were tested by one exam iner. 
АП thirty-six children were in the sam e second 
grade class. Two pupils were absent and were not 
tested. The class was one of two second grades 
housed in the same building. The classes had been 
formed by dividing the children by chronological 
age. The class tested had the older pupils. With 
three exceptions, their chronological ages were 
from 7 years 10 months to 8 years and 5 months. 
The other three children were 8-6, 8-8, and 9-2. 
The testing was done in May, two weeks before 
School closed. Intelligence test scores were not a- 
vailable. However, no special class grouping had 
eliminated either high or low I.Q. children from 
this class. 


The Test and Administrative Procedures 


The test administered was four graded para- 
graphs ranging in length from 39 to 77 words. The 
first paragraph was primer level, the second be- 
ginning second reader level (2-1), thethird high 
second reader level (2-2), and the fourth beginning 
third reader level (3-1). The paragraphs are part 
of the diagnostic reading tests found in the Teach- 
er's Manual of the Sheldon Basic Readers (8). 

The test was administered individually. Each 
pupil read each paragraph orally, andthen told what 
had been read. The time for oral reading was re- 
corded, the kinds of oral errors were recorded, 
and the comprehension was recorded. The pupils 
read the primer paragraph first, reading the other 
paragraphs in the order of their difficulty. Oral 
reading errors were recorded as repetitions, sub- 
stitutions, additions, pronunciations, omissions, 
punctuation, and mumbling or mispronunciation. 

The classroom teacher divided the pupils into 
three groups, good readers, average readers, and 
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poor readers. Using the procedures described by 
Betts (1) for determining basal, instructional, and 
frustration levels of reading performance, the ex- 
aminer divided the pupils into good, average, and 
poor reader groups. The ratings were done inde- 
pendently. 

Errors were considered qualitatively and quan- 
titatively. A qualitative error was counted when- 
ever one of the seven errors occurred. A quantita- 
tave error was counted whenever a new mistake 
was made. For example, if a child omitted a word 
and corrected the omission by repeating a phras e, 
one quantitative and two qualitative errors occurr- 
ed, an omission and a repetition. 


The Results 


The reader groups were almost identical. One 
pupil rated average by the examiner was rated good 
by the teacher. The examiner rating was based up- 
on the pupil's poor comprehension. The pupil had 
read fluently with only one oral error in each para- 
graph. The teacher felt that the comprehension was 
atypical. In tabulating the results, this pupil was 
rated as good. 

Nine pupils were judged good readers, seventeen 
pupils were judged average readers, and ten pupils 
were judged poor readers. One poor reader was 
unable to perform, and is not included inthe tabula- 
tion. Іп tabulating the results, one pupil, a poor 
reader, was unwilling to attempt the 3-1 paragraph. 
His performance is included in the tabulations for 
the first three paragraphs. 

Percentage ratings are used in many ofthe cate- 
gories. Since the groups are not the same size, 
this makes comparison easier. Percentages are 
rounded to the nearest whole number. 

The performance of the pupils inthe three areas, 
speed, comprehension and errors, indicated differ- 
ences in performance on each gradelevel of reading 
difficulty. The good readers tended to be superior 
to both the average and poor readers inall three 
areas. The average readers tended to be better 
than the poor readers except in comprehension. 
This might be explained by the fact that the poor 
readers waited for difficult words to be pronounced, 
while the average readers struggled through them 
without help. 

The speed performances are tabulated in Table 

I, the comprehension results are given in Table II, 
the numbers of qualitative and quantitative errors 
are listed in Table Ш, the errors per onehundred 
running words per child are listed in Table IV, and 
the number and percentage of qualitative errors are 
listed in Table V. 

The comprehension percentage is low for the 
good readers at the 2-2 and 3-1 levels because one 
child blocked after reading, and scored less than 
20%; and because a different child blocked at 2-1 
and 2-2 levels. 

With the exception of the good readers in going 
from primer to second readerlevel materials, each 
group made a greater number of both quantitative 


and qualitative errors at each succeeding grade 
level. There seems to be no difference in the num- 
ber of errors made on the two second level para- 
graphs. However, the word ‘‘boots,’’ which ap- 
pears in the 2-11evel paragraph was miscalled 
**boats"' by all but one child. This mi s calling and 
subsequent correction may ha v e created an artifi- 
cial equality. 

There is some indication that good readers need 
less help with pronouncing than other readers, and 
that average readers need less helpthan poor read- 
ers. In observing this difference, and all the dif- 
ferences which may seem to exist, it is important 
to realize that the differences may be a function of 
the level of the testing materials. For example, 
the percentage of word recognition errors (pronun- 
ciation) would be expected to increase for the good 
reader group if the material used to test that group 
were as difficult, comparatively, forthe good read- 
ers as the testing materials were for the poor read- 
ers. 

The results seem to indicate that oral reading 
errors may not be discriminating, that all children 
make similar errors, and that oral reading per- 
formance (not including comprehension) is a general 
Skill or a composite of highly correlated factors 
rather than a composite of separate skills. The 
number of errors a child makes when reading orally 
is important; the kind of errors he makes may not 
be important. It is possible that analyses of speci- 
fic errors mislead teachers into fragmented teach- 
ing. This may be unnecessary or detrimental. The 
level established for teaching may be much more 
important than the error pattern. 

Comprehension seems to vary without regard to 
the number of oral reading errors, and needs to be 
considered as a separate skill. Fluentoral reading 
does not insure high comprehension, nor does poor 
oral reading mean poor comprehension. Itwould 
Seem, generally speaking, that the rate of oral 
reading probably should not be considered as a sep- 
arate skill. 


Summary 


For these children (a) either quantitative or 
qualitative errors seemed to have equal discrimin- 
atory value in determining their oral reading à- 
chievement; (b) qualitative errors did not seem 10 
discriminate between average and poor readers, al- 
though they may have some value in discriminating 
between good readers andothers; (c) comprehension 
seemed to discriminate between good readers and 
others, but not between poor and average readers; 
(d) speed of oral reading seemed to discriminate a- 
mong good, average and poor reader groups; (е) 
wide ranges of individual performance were observ- 
ed in all three areas—speed, co m prehension, and 
errors—with degrees of overlapping between touch- 
ing groups in all areas. There was some overlap- 
ping in comprehension and errors among the three 
groups. There was по overlapping between 600 


McCRACKEN 115 


TABLE I 


SPEED OF READING IN WORDS PER MINUTE (RANGES IN SECONDS IN PARENTHESIS) 


раш aE OLLLCÓLLAAALÉLLLLAG ODLAROAGOOUOULiGIALNGHILLOLLLOL íS.Ó O)OCOGgGST :PAGM 


Reader-Pupil Classification Levels of Material 
(Number in parenthesis) E РЕДАК ESO ТСМ ры E др: 
Good (9) 151 (16-30) 117 (16-25) 127 (17-37) 112 (30-62) 
Average (17) 101 (17-45) 81 (17-47) 88 (22-56) 62 (38-102) 
Poor (9) 57 (38-92) 43 (38-68) 49 (42-117) 34* (106-173) 
Total 91 72 75 62 


* One pupil did not read at this level 
TABLE II 


AVERAGE COMPREHENSION PERCENTAGES AT EACH LEVEL. (IN PARENTHESIS, THE 
RAW SCORE RANGES IN NUMBER OF RESPONSES) 


eS 


Reader-Pupil Classification Levels of Material 
(Number in parenthesis) Primer 2-1 2-2 3-1 
(10 responses (13 responses (17 responses (18 responses 
possible) possible) possible) possible) 
Good (9) 70% (4-10) 84% (2-13 62% (3-14) 64% (3-14) 
Average (17) 56% (0-10) 69% (2-13) 59% (6-14) 51% (5-15) 
Poor (9) 62% (5-8) 75% (1-12) 66% (6-13) 56%* (5-15) 
Total 61% 12% 62% 56% 
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* One pupil did not read at this level 
TABLE III 


TOTAL NUMBER OF ORAL READING ERRORS 


"Quantitative errors 


(Individual child's range raw scores in parens.) Qualitative errors 
Primer = = = Tot Primer 2-1 2-2 3-1 Total 
Be у, 5 SBS E E ee а салла Шш кас шшш ш 
9 Good Readers 11 (0-3) 11 (0-4) 13 (0-3) 32 (1-8) 67 15 Ts ҮДА 99 88 
17 Average Readers 44 (0-7) 67 (1-8) 67 (1-8) 152 (2-15) 330 61 90 85 218 454 
9 Poor Readers 57 (1-15) 76 (4-14) 88 (6-16)187*(11-34) 408 80 112 116 264 572 
Total 112 154 168 371 805 156 219 218 521 1114 


* One child was unable to perform at this level 


TABLE IV 


ERRORS PER 100 RUNNING WORDS PER CHILD (PERCENTAGES PER LEVEL) 


2.3 (12%) 3.1 (9%) 2.8 (9%) 4. 6 (10%) 3.1 (12%) 4.7 (9%) 3.7 (10%) 5.6 (9%) 
Average 4. 9 (26%) 10.1 (29%) 7.7 (26%) 12.4 (26%) 6. 9 (26%) 13.6 (27%) 10.4 (28%) 16.7 (26%) 
Роог 12.0 (63%) 21.7 (62%) 19.2 (65%) 30.4 (64%) 16.8 (63%) 31.9 (63%) 22.7 (62%) 42.8 (66%) 
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TABLE V 


NUMBER AND PERCENTAGE FOR EACH READER GROUP OF QUALITATIVE ERRORS 


Repetitions Substitutions Additions Pronunciations Omissions Punctuation Mumbling and Total 
Mispronunciations 
9 Good readers 33 (38%*) 19 (22%) 17 (19%) 9 (10%) 7 (8%) 0 (0%) 3 (3%) 88 (1004) 
17 Average readers 119 (26%) 113 (25%) 50 (11%) 127 (28%) 36 (8%) 8 (2%) 1 454 (100%) 
9 Poor Readers 138 (24%) 111 (19%) 65 (11%) 219 (3895) 30 (5%) 5 (1%) 4 (1%) 572 (100%) 
Total 290 (26%) 243 (22%) 132 (12%) 355 (32%) 73 (7%) 13 (1%) 8 (1%) 1114 
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* Percentages refer to each reader group, not to the type of error. 
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and poor reader groups in speed; (f) word recogni- 
tion difficulty seemed to be the major problem for 
the average and poor readers; (g) comprehension 
seemed to be low. It seemed that the free response 
comprehension was difficult for these children. 
This would corroborate the findings of Sheldon and 
Hatch (9, 10, 11) and Hatch and Sheldon (7). 
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An Evaluation of Course Methods in Teaching Child Development 


ALTHOUGH MANY investigations have studied 
the attitudes which parents andteachers hold toward 
children, little attention has been given to the pro- 
cess of formation and development of these attitudes. 
The effectiveness of courses on Child Development 
in programs of parent education and teacher educa- 
tion are generally not evaluated in terms of attitud- 
inal change or on the basis of children's adjustment. 

There was a twofold purpose in conducting this 
study: first, to compare the relative efficacy of 
three different methods of colle ge teaching, and 
secondly, to determine whether students’ attitudes 
toward children are measurably incluenc ed bya 
course in child development. Similar research has 
been conducted by McDougall (9) who investi gated 
the changes of students' opinion associated with dif- 
ferent methods of teachingIntroductory Psychology. 
Birney and McKeachie (2) have presented a review 
of the research in this area; and in a further report 
(1) state that **attempts to measure a Supposed su- 
periority of student-centered group discussion 
classes over an instructor-c entered cl ass have 
failed. 

Egner (6) points out that new criteria are needed 
to evaluate education. Hes uggests that students' 
reactions and changes of behavior, whichcan be at- 
tributed to the learning experience, be used as cri- 
teria. Studies by Costin (4) and Hurley and Laffey 
(8) present evidence that courses in child psychology 
influence students toward more favorable attitudes 
toward children. A recent Study by Freedham and 
Hackett (7) evaluated changes in attitudes toward 
children resulting from a course on the pre-school 
child. 


Method 


There were a total of 145 students in the study 
which made up three sections of a course in Child 
Growth and Development in the School of Education 
at U.C.L. A. The majority of these students, 63%, 
were undergraduates in elementary education, and 
18% were in nursing education. Minor numbers 
were in secondary education, three percent, and in 
graduate school, eight percent. Experienced teach- 
rs and nurses comprised nine ре rcent, and 13% 
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were also parents. Although the students were not 
randomly assigned to the sections init ially, there 
was no apparent source of variation between the 
sections. They were representative of the popula- 
tion of students in elementary education at this in- 
stitution. 

The sections of the course were taught utilizing 
three different instructional methods: 1) lecture, 
2) case-centered, and 3) group-centered. The 
same texts (10, 11) were required in each section, 
but other course requirements and class meetings 
were varied to suit the method. Thelectures for 
Section I were supplementary to the assigned read- 
ing. These students were also required to write 
reports on topics of physical, social or personal 
development. The emphasis in the lectures and re- 
ports for this section was on research and theoret- 
ical information. 

The case-centered approach utilized class dis- 
cussion of case studies and behavior observations. 
Each student observed a child at the University El- 
ementary School for the duration of the semester. 
The written requirements included four reports of 
behavior observations which were processed ac 
cording to the child Study procedures as outlinedin 
Prescott (11). The instructor servedasa resource 
person to supplement the child study approach. 

In the group-centered section, three sub-groups 
of fourteen students each were established on the 
basis of pretest scores on the Minnesota Teacher 
Attitude Inventory (3). The seven students who ob- 
tained the highest scores, and the seven students 
who obtained the lowest scores were assigned to one 
group. This combination provided a maximum 
range of attitudinal difference within one sub-group. 
The second sub-group was com posed of the next 
Seven from the upper andsevenfrom the lower ends 
of the distribution, that is, from the intermediate 
ranges. The third group was composed of the four- 
teen students from the middle of the distribution, i. 
e., with a minimum range of difference. Discus- 
sion leaders were provided for each of these groups. 
The students in the sub-groups planned their own 
discussions, field trips, and reviews. 

The experiment did not attempt to contrast stu- 
dent-centered vs. instructor-centered classes, di- 


rective vs. non-directive, northeoretical vs. prac- 
tical approaches. In each of the sections, latitude 
was given to the students as to the choice of cases, 
topics or problems on which they wished to report. 
Although the methods may have, inherent in them, 
different exphasis on theory, practice, res earch, 
and group direction, no special attem pt was made 
to control these factors. The major purpose of the 
experiment was to appraise the result of course and 
method on attitudes and achievement. 


Evaluation 


The Minnesota Teacher Attitude Inventory and 
Shoben's Parent Attitude Survey (12) were adminis- 
tered to all students as pretests.* They were asked 
to respond to the questions in a manner which would 
indicate their present attitudes as hypothetical par- 
ents, (PAS) and prospective teachers (MTAI). The 
majority of the items on the MTAI are essentially 
attitudes toward children (118 of 150). There was 
no pretest of their knowledge of Child Development. 
Prerequisites forthe course included a course in 
introductory psychology and physiological psychol- 
ору or mental hygiene. There were no known fac- 
tors of selection or bias, so the groups were as- 
sumed to be comparable in ability and pre-course 
achievement. Тһе final evaluation utilized scores 
on the mid-term and final examinations, and post- 
tests on the above attitude scales. 


Analysis of the Data 


The scores on the mid-term and final examina- 
tions were combined into one achiev ement score. 
The mean scores for each of the sections were as 
follows: lecture 186. 4, case-centered 181.8, and 
group-centered 181.1. The summary statistics for 
achievement scores and the pre-test and post-test 
attitude scores are shown in Table I. The achieve- 
ments for the sections were compared, utilizing 
analysis of variance. The data for this analysis 
are shown in Table II. 

The results of the analysis indicates that there 
was no significant variationbetween the sections on 
achievement. Тһе course outcomes, as measured 
by mid-term and final tests, were not significantly 
influenced by any of the course methods under test. 
It was concluded that theattainment of knowledge a- 
bout child development occurred as a result of en- 
rollment in the course and was independent of the 
Instructional method. 

The pre-test means on the MTAI and PAS were 
compared and the initial variation among the groups 
9n these measures was found to be non-significant. 
The correlation between the pretest distributions 
9n the PAS and the pretest MTAI was -. 306; andthe 
Correlation between the post-test distributions was 
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-.307. The negative relationship is appropriate 
since favorable attitudes are indicated by high 
scores on the MTAI and by low scores on the PAS; 
and poor attitudes conversely. The correlation be- 
tween the instruments is significant atthe . 01 level. 

The next analyses consider the difference be- 
tween pre- and post-test attitude scoresfor eachin- 
dividual student. Positive change, or improvement 
in attitude, was shown by an increase in the post- 
test score of the MTAI and byadecrease in the 
post-test score of the PAS; and negative changes by 
the opposite differences. Each student, therefore, 
received two change scores. Themean change 
which occurred between the MTAI pretest and post- 
test for all sections combined was 11.92, and the 
mean change on the PAS for all sections was -13. 95. 
The changes on the MTAI observed in each of the 
sections аге as follows: lecture 11.2, case-center- 
ed 9.7, group-centered 14.3. Changes on the PAS 
were 10.8, 17.0, and 15.3 for the corresponding 
groups. Inall sections, the directionof change was 
toward more favorable attitudes. 

The “Ё” test was used totest for the significance 
of differences between means of individual pre- and 
post-test scores. Values of 6.27 and 7.34 were ob- 
tained for differences on the MTAI and the PAS re- 
spectively. Both of these ‘‘t’’ values are significant 
at the .001 level, indicating that positive improve- 
ments in the students’ attitudes toward children had 
occurred during the course in Child Development. 

The change scores on the MTAI were correlated 
with the change scores onthe PAS. The correlation 
coefficient of +.23, not corrected for attenuation, 
was significant at the . 01 level. Thus, the differ- 
ence which occurred between the pre- and post-test 
on the MTAI were associated with the differences 
which appeared on the PAS. 

These change scores were then combined alge- 
braically into one composite score, This composite 
score represented the positive and/or negative 
changes in attitudes for each student. The means 
for the composite change for each of the sections 
were as follows: lecture 12.3, case-centered 29.4, 
and group-centered 29.6. The correlations between 
the achievement scores and the change scores for 
all sections combined was then computed. Тһе co- 
efficient .29 is significant at the one percent level 
for this sample size. It was concluded that course 
achievement was significantly related to attitudinal 
change. 

The next analysis was to determine whether the 
attitudinal changes could be attributed to course 
methods. The analysis of variance problemis 
shown in Table III. The findings indicate that there 
was no significant variation between the groups 
which could be attributed to the teaching method. 
Thus, although there were significant positive 
changes in attitudes toward children resulting from 


* Hereafter in this article, these attitude inventories will be referred to as the MTAI and the PAS respectively. 
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TABLE I 


SUMMARY STATISTICS ON ACHIEVEMENT AND ATTITUDE SCORES* 
=m 


Lecture Case-Centered Group-Centered 
Achievement Means 186.4 181.8 181.1 
S.D.'s 16.9 20.3 17.2 
MTAI Pretest Means 133.7 141.2 142.4 
S.D.'s 30.4 26.9 32.5 
PAS Pretest Means 113.3 123.8 120.8 
S.D.'s 26.9 22.9 29.0 
MTAI Post-test Means 144.1 152.2 155.9 
S.D.'s 31.5 13.4 28.9 
PAS Post-test Means 110.2 107.8 107.6 
S.D.'s 18.0 19.6 21.8 


* For ease in computation procedures, a constant of 100 was added to the MTAI 
raw scores and a constant of 200 was subtracted from PAS raw scores. 
TABLE II 
ANALYSIS OF VARIANCE ON ACHIEVEMENT 


Source of Variation d. f. Sums of Squares Mean Square F Ratio 
eese RN :. 
Method 2 814. 6 407.3 1.2 
Within-group 142 48138. 6 339.0 hypothesis 
accepted 
Total 144 48953. 2 


TABLE III 
ANALYSIS OF VARIANCE ON ATTITUDINAL CHANGE 


= ар 


Suurce of Variation d.f. Sums of Squares Mean Square F Ratio 
Method 2 9650. 7 4825.4 1.69 
Within-group 142 405255. 3 2853.9 hypothesis 
accepted 
Total 144 414906. 0 


= se В.Е 


| 


—-—-——— 


121 


TABLE IV 
MEAN CHANGE SCORES ON MTAI IN GROUP-CENTERED SEC TION 


Maximum Range 


Intermediate Range 


Minimum Range 


Pretest Lower (21) 24.9 
Pretest Upper (21) .14 
Total (42) 12.5 


the course, no one method was superior to any oth- 
'er method in producing these changes. 

The change scores for the MTAI in the sub-groups 
of the section in which the group-centered approach 
was employed were thenanalyzed. Themeanchange 
for the entire section was +14.3. The sub-group 
means are shown in Table IV. The comparison of 
changes which occurred in the sub-groups indicated 
that they do not differ from each other inthe amount 
of attitudinal change. 


Findings of the Study 


1. Achievement in the course in Child Develop- 
ment as measured by course tests is not significant- 
ly related to teaching method. Factorssuchas mo- 
tivation and competition may have exerteda stronger 
common influence toward achievement than the var- 
lation in teaching method. 

2. Favorable changes in attitudes toward child- 


' ren were a result of the course experiences. Mean 


differences between pre-test and post-test scores 
Оп the Minnesota Teacher Attitude Inventory and the 
Parent Attitude Survey were significant. 

3. The two attitude instruments used in this 
Study were significantly related to each other. This 
Correlation tends to support their concurrent valid- 
ity for measuring attitudes toward children. 

4. Change scores on attitudinal measures were 
Significantly correlated with achievement scores. 
Those students who attained the highest achievement 
Scores made the greatest gains in attitudes toward 
children, 

9. The change between individual pre- and post- 


lest scores on the Teacher Attitude Inventory were 
Significantly related to the change scores on the 


Parent Attitude Survey. 

ТА Although positive attitudinal change wasa 
te course outcome for all sections, no one 

method of teaching proved superior to the other 

Methods in producing this change. 

mi The control of heterogeneity and homogeneity 

ai initial attitudes of members in discussion 

2 “groups had no significant influence on the sub- 

equent change in attitudes. 


27.4 22.0 
3.6 6.4 
15.5 14.2 


Implications for Further Research 


There are several experimental hypotheses which 
could not be adequately tested in this study and 
which should be investigated in further research. 
One set of hypotheses could be stated as follows: 1) 
Maximum growth in attitudinal change is attained 
when an intermediate variation in initial attitudes 
exists among the members of a group, and 2) Mini- 
mum change occurs when there is a maximum var- 
iation between group members in their original at- 
titudes. Research on grouping practices in educa- 
tion have previously beenlimited to the variations 
in ability and achievement. The optimum grouping 
on personal and social variables, relative to com- 
munication, productivity, and non-intellectual a- 
chievement has not been determined by research in 
social psychology, group therapy, or in education. 
If attitudes and ideals are regarded as educational 
objectives than perhaps more research is needed to 
determine the methods, conditions, and procedures 
capable of producing such outcomes. 

Teachers' attitudes have sucha critical influence 
in the social and emotional development of children 
that it seems important to take them into greater 
account during their professionaltraining. Profes- 
sional courses, however, are not the only deter- 
minant of such attitudes. Other determinants might 
include: 1) the experiences of the teacher as a pupil, 
2) the basic personality traits of the teacher, and 3) 
the status which pupils have in school and in the 
home. Research is suggested on these possible de- 
terminants as well as on the specific nature of 
teachers' attitudes. 
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| Upward Mobility and |. Q. Performance 


THE OBJEC TIVE of this study isfurtherto elab- 
orate on the lead taken by Davis (1), Havighurst, 
(3, 4, 5), and Eels (2) whenthey enumerated certain 
socio-psychological factors of social class and the 
relationshi p of these to а youngster's performance 
оп ап 1.0. test. The point of view adopted here is 
that the mental аре” of a child is determined by а 
complex interaction between innate and environmen- 
tal factors and that both of these factors are mir- 
rored either directly or indirectly in the social psy- 
chological dynamics of the family. The attempt be- 
ing made here, then, is not to dictate which is of 
greater importance in determining the performance 
of a child on an I. Q. test, heredity or environment, 
but to find associations between a child's I. ©. score 
and variables of the social milieu to which he is ex- 
posed. 

The principal independent variable of interest in 
this study is a family's social mobility. Because of 
the present importance placed on f o r mal education 
in the occupational and social hierarchy of our 
country, it is reasonable to assert thatthose famil- 
les which are upwardly mobile should be exerting 
a fair amount of pressure ontheir children to a- 
Chieve a high level of competence in s c hool activi- 
ties. It is also quite reasonable to assert that typ- 
ically academic skills such as reading, the manip- 
ulation of numbers, and creative thinking will be 
More greatly emphasized and reinforced by these 
family members than by those families either main- 
taining the social class level of their parents or on 
the ‘way down’? in the society. 

The second objective is to find additional vari- 
ables which relate to the performance of a young- 
Ster on an I. ©. test such as the occupational type 
(head or hands) of the father, and the formal educa- 
tion Of the parents. The expectations here, follow- 
ing a similar line of reasoning as above, would be 
2 find a high degree of association between amount 
9f formal education and occupational type on the 
Part of the father; and anassociation between pater- 
nal occupation and/or education, and thechild’s 


p 0 | LARS, зоо HE RM MM 
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performance on an I. Q. test. 


Hypotheses 


А. Children from upwardly mobile families will 
tend to score higher on I. Q. tests thanchildren 
from non-upwardly mobile families. 

B. Children of fathers with a‘‘head’’ occupation 
will tend to score higher on I. Q. tests than children 
of fathers with a ‘‘hands’’ occupation. 

C. Children of parents with more than a twelfth 
grade education will tend to score higher on I.Q. 
tests than children of parents with а twelfth grade 
education or less. 


Procedure 


Measurement of social mobility.1 Тһе follow- 
ing three variables are used in determining the ex- 
tent and direction of mobility for each family studied. 
Included under each variable is a description of the 
distribution of points for measurement. Each vari- 
able may have a maximum of plus two and a mini- 
mum of minus two, making the possible range of 
mobility scores plus six to minus six. 

1. Ecological movement: 

a. Parent has moved from a nearby town 
rated as lower in average socio-economic 
status when compared to present dwelling a- 
rea, 41.0? (This might include: 
1) Parent born in a foreign country and 
migrates to America. 
2) Parent born in the deep south and mi- 
grates north. 
3) Parent has moved from a sm all rural 
town to an industrial suburb. 
4) Parent has moved from alarge urban 
city to suburbia. 
b. Parent has moved from a nearby town 
rated as higher in average socio-economic 
status. -1.0 
c. Parent has always lived in present dwell- 
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ing area. 0.) 

2. Occupational Status: 
a. If male parent's occupation is ‘‘head’’ and 
his father's was “һапбв”, +2. ° 
b. If just the opposite conditionis true, -2. 
c. If male parent's occupationis inthe same 
category as his father's, 0. 

3. Formal education: 
a. If male parent has four more years than 
his father, «1.0. 
b. If male parent has one, two, or three 
years more than his father, +. 5. 
c. If male parent has the same as his father, 


d. If male parent has less than his father, - 1. 
e. If female parent has four more years than 
her mother, «1.0. 

1. If female parent has one, two or three 
years more than her mother, +.5. 

g. If female parent has the same as her 
mother, 0. 

h. If. female parent hasless than her moth- 
ег, -1, 

i. If both male parent and his father have 
16 years or more, «1.0. 

j. If both female parent and her mother have 
16 years ог more, «1.0. 


The group studied. Children of the fourth grade 
from a grammar school in West Dearborn, Michi- 


gan (suburb of Detroit) and their parents were sub- 
jects for the study. Kuhlmann-Finch I. Q. scores 
were obtained for each ofthe 90 pupils in the study. 
Each child took home a questionnaire which was to 
be filled out by both of his parents. The question- 
naires were to be returned in three days. 

Because of the nature of the group, it will be 
impossible to generalize beyond it. Therefore, the 
word “ехріогаіогу” is used herein its fullest sense 
for describing this study even though the experi- 
menters are fully cognizant of the enigma which 
presently plagues it. On theother hand, even though 
the investigators realize the inferiority involved in 
the sampling technique used and that this sample of 
respondents is at the same time the populationto 
which any generalizations must be made , they 
whole-heartedly feel that any hypotheses which get 
reinforced on this population deserve to be tested in 
a larger survey with sound sampling techniques. 


Results 


Dwelling area of the group. Тһе section of Dear- 
born from which this group was drawnis a prosper- 
ous, middle class settlement. In comparison to 
sections of Dearborn to the north and northwest of 
it, it is lower in socio-economic status; but, when 
compared to areas of neighboring villages, it is, 
generally speaking, higher in socio-economic status 
because it is void of lower class dwelling areas and 
slum areas. Warner's description of Dwelling Ar- 


ea No. 3 best describes this dwelling area, 


“A little above average in social reputation and 
to the eye of the scientific observer. This area has 
nice but not pretentious houses. The streets are 
kept clean and the houses are well cared for, It is 
known as a “пісе place tolive' but **society doesn’t 
live һеге”,” 

This section of Dearborn, then, is made up of 
families climbing the “ladder of social mobility", 
It is at most only 15 years old as a prosperous resi- 
dential neighborhood. Itis made up of Christian 
Caucasians who have come to Michigan or who һауе 
come down from upper Michigan to reap the mater- 
ial benefits of industrial Detroit. 

Description of the group. Ofthe questionnaires 
sent out, percent were returned. Of those re- 
turned, 90 percent were completed leaving 72 per- 
cent of the total ready for analysis. 

Only three parents in the entire set of 130 par- 
ents answering questions were born and raised in 


Dearborn. Ninety-seven and seven-tenths percent 
of the parents are movers". The most popular 
areas from which these people came are 1) Detroit, 
2) upper Michigan, 3) regions of the ddep south and 
4) foreign countries such as Canada, Scotland, and 
England. 

Every family reported at least one car and there 
is a preponderance of Fords as might be expected 
since the Ford Motor Company is located іп Dear- 
born. Virtually all (97 percent) of the families are 
homeowners with a mortgage. Themean number of 
children for a family is 3.1. The median income of 
the sample families is slightly over $6, 000 per year. 
Each of the families owns a television set and most 
of them watch it regularly. The mean number of 
hours the fourth-grade youngster in each family 
watches television is 17.5 per week. Few parents 
find time to read anything but the newspaper and a 
few magazines. The most popular magazines listed 
are Readers Digest (listed most often), Ladies 
Home Journal, Life, and Time (the most popular 
news magazine). " ЕЖЕ! 

High апа low І. Ө. groups. Each of the pupils in 
the study has a score on the Kuhlmann-Finch Test 
of Intelligence. These scores range from 84 0 
with the median at 112. Since the standard error of 
measurement for the test is approximately 5, it was 
decided not to dicotomize at the median in order to 
avoid overlap in the two groups. The high group i$ 
made up of pupils scoring atthe 60th percentile and 
above (115-137) while thelow group is made up of 
pupils scoring at the 40th percentile and below (110 
-84). Each group is made up of 26 pupils making 2 
total of 52 in the analysis of the data. j 

Social Mobility Hypothesis. Since this group 
parents is homogeneously upwardly mobile, the 0- 
riginal hypothesis is changed to read: Children 
from “‘distinctly”’ upwardly mobile families Vi 
tend to score higher on I.Q. tests than children 
from ‘moderately or negligibly’? up war dly mobile 
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TABLE I 


RELATIONSHIP BETWEEN A FAMILY'S SOCIAL MOBILITY 
AND THE PERFORMANCE OF THAT FAMILY'S FOURTH- 
GRADE CHILD ON ANI. Q. TEST 


TAM SSS 


High I. Q. Low 1.0. 
——————— MÀ EE CM NEN on or o ier sadi 
“Distinctly” Upwardly Mobile 19 6 


X'-11.1 P< .001 
*'Negligibly" Upwardly Mobile 7 20 (1 d.f., corrected) 


S 


TABLE II 
l 


RELATIONSHIP BETWEEN PATERNAL OCCUPATIONAL GROUP 
AND PERFORMANCE OF CHILD ON AN I.Q. TEST 


к_———Є—Є—Є—Є—Є—Є—Є—Є—Є—Є——Єї————Є—Є—Є&Є{&—Є——Є—Є—Є—Єїї———————Є 


High I. Q. Low 1.0. 
**Hands?' 13 22 
X? = 5.59°.02>p>. 01 
“Неаа”’ 13 4 (1d.f., corrected) 
TABLE III 


RELATIONSHIP BETWEEN PATERNAL FORMAL EDUCATION AND 
PERFORMANCE OF CHILD ON AN I.Q. TEST 


ee 


High I. Q. Low I. Q. 
12 years or less 16 23 
4 X*-3.69 .10>р>.05 
12 years or more 10 3 (1d.f., corrected) 
TABLE IV 


RELATIONSHIP BETWEEN OCCUPATION OF MOTHERS IN “НАМр8” 
FAMILIES AND THE PERFORMANCE OF CHILD ONI.Q. TEST 


e 


High I. Q. Low I. Q. 
HERRERA der 20 DIETE ade e mc YIII TENE Se АННУ 
Mother works 2 13 2 


X -5.94 .02>p>.01 


Mother is homemaker 11 9 (1 4.1., corrected) 


| een 
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families. ‘‘Distinctly’’ upwardly mobile families 
are operationalized as ones receiving above +3 on 
the index while “moderately or negligibly” upward- 
ly mobile families are ones receiving +3 or less on 
the index. The results show a definite association 
between ‘‘distinctly’’ upwardly mobile families and 
high I. Q. children, and between *moderatel yor 
negligibly’’ upwardly mobile parents and low I. Q. 
children. 

The results in Table II show an association be- 
tween a child's I. Q. and his father's Occupational 
type in the expected manner. The results in Table 
III show the association between a child's I. Q. and 
his or her father's formal education. No relation- 
Shipwhatsoever was found between a child's I.Q. 
and the amount of his mother's formal education. 

There is, as would certainly be expected, a very 
significant relationship between a father's occupa- 
tional type and the amount of formal education he 
has had. The extent of this relationship as measur- 
ed by a phi coefficient is a positive . 83. 

Table II shows a probability of . 63 (22 out of 35) 
that a father with a ‘‘hands’’ occupation will have a 
child scoring in the low I.Q. group in this sample. 
This rather close to chance (. 50) result prompted 
the experimenters to search for another variable 
which might further differentiate between the high 
and low 1.0. scorers within the “һала” occupation- 
al group. Table IV shows the results of testing 
whether or not a mother’s working outside ofthe 
home in a **hands" occupational family would bear 
any relationship to a child being in the high or low 
1. Ө. group. 

The salient point about these results is that there 
are only two families in which the father hasa 
*thands"' occupation, the mother Works, and the 
child is in the high I. Q. group. Thus, if itis known 
that a child's father has a “һапбв” occ upation and 
that his mother works, one will be correct 90 per- 
cent of the time by guessing that this child is in the 
low І. ©. group. 


Discussion 
fet ANE 


The differentiation between the ‘distinctly’? up- 
wardly mobile group and the ‘‘moderately or-negli- 
gibly’’ upwardly mobile group is due almost exclu- 
sively to occupational and educational changes since 
97.7 percent of the parents sampled are classified 
as ‘‘positive ecological movers", Those parents 
who don’t seem to change occupationally and educa- 
tionally from their own parents tend to have the low 
I.Q. children, while those parents who do change 
occupationally and educationally from their own 
parents tend to have the high І.0. children. Fa- 
thers who maintain hands occupations tend to have 
the low I. Q. children while fathers who maintain 
head occupations tend to have the highI.Q. children. 
Fathers who have not gone to coll ege tend to have 
the low I. ©. children while fathers who have attend- 
^d some college tend to have the highI.Q. children. 


None of these results should be astound ing for 
anyone who thinks back on the crude proc edures 
Binet used for validating hisfirst intelligence tests, 
He merely went to the particular social groups which 
Seemed to him to be breeding grounds for preco- 
cious youngsters and to other groups which Struck 
him as being the kinds which bred and trained the 
“duller’’ children. He was interested in predicting 
success in school and not some m ystical quality 
hidden in the cerebrum. In essence, the Social 
group which he thought to yield the successful stu- 
dents most often was the professional middle class, 
the head occupational people, the highly educated. 
And, conversely, the “duller” children, the less 
successful students, came from the working class, 
the hands occupational group, the less well educat- 
ed. He then repeatedly changed his test until the 
high and low scores went to children from these two 
social groups respectively. 

It may be well for educational ps y c hologists to 
look more closely at just what they are measuring 
when they administer and score an I. Q. test. 


Summary 


Ап attempt was made to find rel ationships be- 
tween sociological variables and the performance on 
ап 1.0. test of fourth grade pupils in an all caucas- 
ian, all Christian, Detroit suburb. 

Results show a significant rel ationship between 
a family's social mobility and the performance of 
their fourth grade child on the Kuhlmann- Finch Test 
of Intelligence such that “distinctly” upwardly mo- 
bile families tend to have high I. Q. children while 
“negligibly’’ upwardly mobile families tend to have 
the low I. ©. children. Furthermore, the data re- 
vealed that ‘thead’’ occupational fathers tend to have 
high I.Q. children while **hands"' occ u pational fa- 
thers tend to have low LQ. children, and that fa- 
thers with more than twelve years of formal educa- 
tion tend to have high 1.0. children while fathers 
with twelve years or less of formal educ ation tend 
to have low I. Q. children. 

Still more scrutiny of the data revealed the unex- 
pected result that children whose fathers havea 
"thands'* occupation and whose mothers work outside 
the home tend to score in thelowI.Q. group ninety- 
Six percent of the time. 


FOOTNOTES 


Ж Complete addresses: Richard A. Schmuck, In- 
stitute for Social Research, University of 
Michigan; Robert W. Schmuck, School Dis- 
trict Number 8, Dearborn, Michigan. 

1. The universal appropriateness of this index for 
the measurement of social mobility is ques- 
tionable indeed; however, for the particular 
community studied (see section headed'*Re- 
Sults”’), this index is deemed adequate. 

2. The Warner Index on dwelling area is used 254 


4. 


1. 


SCHMUCK 


uide. 

ae dicotomy between ‘‘hands and head’? occu- 
pations used in this study is derived from the 
Warner Index by calling Warner’s 1, 2, 3 cat- 
egories HEAD and categories 4, 5, 6, and 7 
HANDS. 

W. L. Warner, et al., SocialClassin America, 
(Gloucester, Mass: Peter Smith Press, ; 
p. 153. 
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Changing the Attitudes of Medical School Instructors 


THIS STUDY reports attitude changes in a group 
of thirty-two medical school instructors, during an 
intensive ten day medical teaching institute, held at 
the University of Buffalo in the summer of 1959. 
While the reader may react—and with good reason— 
that this is simply another in the endless stream of 
attitude change studies, some of which often border 
on mediocrity, it does claim distinction (perhaps its 
sole claim) in that it involves a highly *'sophisti- 
cated" and experienced group of medical school 
teachers. 

Briefly, this is the setting in which the study 
took place: During June, 1959, the University of 
Buffalo School of Medicine, in collaboration with 
the Association of American Medical Colle ges, 
sponsored the Second Summer Institute on Medical 
Teaching. This institute, like the first, held in 
June, 1958, was under the direction of George E. 
Miller, M.D. and received its financial s upport 
from the Commonwealth Fund of New York. During 
early March, 1959, a brochure announcing the pro- 
posed institute was sent to all Deans andd epart- 
mental chairmen of medical schools in the United 
States and Canada, They were requested to publi- 
cize the institute to all their faculty, and those in- 
terested in attending were to reply directly to Buf- 
falo for an application and any additional informa- 
tion desired. 

After all applicant queries had been answer ed, 
and applications processed, there remained a list 
of approximately 105 medical school instructors 
who desired to attend. Since there were only thir- 
ty-two vacancies, a priority system had to be es- 
tablished that would give the institute a heterogen- 
eous faculty composition within such a highly homo- 
geneous group. It was arbitrarily decided, there- 
fore, to have representation of 1) a proportionate 
number of basic science and clinical faculty, 2) at 

least one member from each of the basic science 
and clinical disciplines, 3) teachers ona part- and 
full-time appointment, 4) all faculty ranks, 5) par- 
ticipants of medical schools from all geographic 
areas, 6) M.D. and Ph.D. degrees, and 7) if pos- 


* Footnotes will be found at the end of the article. 
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sible, two members from each medical school. The 
latter priority was established because the institute 
planners felt that on returning to his own medical 
school, each participant would feel more secure in 
knowing that he had ап “аПу”. Of all the priorities 
only number seven was difficult to adhere to: some 
schools had only one applicant and often that partic- 
ular individual’s qualifications satisfied the other 
five priorities which were considered more essen- 
tial. 

The organization of the institute is much too e- 
laborate to report here.!* For the purpose of this 
presentation, it is enough to say that for the ten 
days at Buffalo, the participants were housed in 
student dormitories and attended the formal ses- 
sions of the institute from 9 a.m. to 5 p.m. andon 
the one Saturday from 9 а. m. to 12 т. Evenings 
and Sunday were free, but most of the participants 
held “рші sessions” on medical education and did 
research and reading relating to their own particu- 
lar teaching problems. 

The major areas covered by the institute dealt 
with 1) evaluation in education, 2) methods of in- 
struction, and 3) the teaching-learning process. 
Less time was devoted to the psycho-socio aspects 
of the medical student, and an historical and philo- 
Sophical view of higher education with emphasis on 
medical education. 

Since one of the objectives of the institute wasto 
bring about changes in the teaching attitudes of the 
participants, in planning, the institute leaders felt 
confident thatlike the first institute, this one would 
also be a success. The first institute was evaluat- 
ей by an independent team who with tape recorders 
and following a previously developed interview 
Schedule, interviewed all the participants. For 
those who would, or could, take time to read the 
results of the interviews, there could be no doubt 
of the changes in the attitudes of the pa rticipate 
On the other hand, the purist wanted statistic?) 
evidence that the attitudes of such a «sophisticated 
group as medical school instructors co uld change 
during an educational program. Therefore, this 
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TABLE I 


RANGE OF POSSIBLE MAXIMUM AND MINIMUM SC 
ORES ON THE 
MEDICAL SCHOOL INSTRUC TOR ATTITUDE INVENTORY 


——————————Є——ЄЄ—Є—Є—Єү——————Є———Є 


Inventory Section Range 

Section I 

Completely Democratic 160 

Completely Autocratic 0 
Section II 

Completely Critical 80 

Completely Complimentary 0 
Section III 

Completely Liberal 96 

Completely traditional 0 
Section IV 

Completely Appreciative 48 

Completely Depreciative 0 
Section V 

Favorable to Full-time 48 

Unfavorable to Full-time 0 
Section VI 

Favorable to Part-time 48 

Unfavorable to Part-time 0 


was the challenge of the second institute-provide 
Such statistical data. 
b Not yet developed for the first summer institute, 
this time there was available a device that could 
measure the attitudes of medical school instructors. 
During the three years prior tothe second institute, 
there was being prepared the Medical School In- 
poror Attitude Inventory (MSIAI) which objective- 
(абаваген attitudes of medical school instructors 
лап Various areas in education. The MSIAI is 
Due into six sections: Section I which measures 
ері; остайс-Ашосгайс Attitude Toward Teaching; 
Me MN П, Critical-Complimentary Attitude Toward 
tiis ae School; Section III, Liberal-Traditional At- 
i ecl Toward Medical Education; Section IV, Ap- 
5 tui eaive-Depreciative Attitude Toward Medical 
fide F S; Section V, Favorable-Unfa vo rable Atti- 
which Oward Full-Time Teachers; and Section VI 
To measures Favorable-Unfavorable Attitude 
Ward Part- Time Teachers. 
Pus Бам itself was standardized on а popu- 
faga 100 medical school instructors and con- 
cM m all the psychometric requirements for 
the En evice. In addition, it was used to measure 
Boia itudes of medical school instructors in a na- 
nal study, and after this use was further refined. 


By the summer of 1959, the MSIAI was supported by 
evidence as being an objective, reliable and valid 
inventory of attitudes of medical school instructors. 
The range of possible scores on this inventory are 
included in Table I. 

To test the hypothesis that an education program 
such as the institute could bring about changes in 
attitudes in medical school instructors, a null hy- 
pothesis was stated: There would be no significant 
change in mean scores on SectionsI and IV of the 
MSIAI for the participants in the Second Summer 
Institute on Medical Teaching. Sections I and IV 
were to be tested by the null hypothesis, for these 
two sections of the inventory were most representa- 
tive of the content of the institute. Moreover, this 
null hypothesis, and as a matter of fact the entire 
study, was based on the assumption that the MSIAI 
was a reliable and valid measure of medical school 
instructor attitudes. There was no reason to be- 
lieve it was not; all the evidence clearly lead to 
this basic assumption. 

On the morning of the first day of the institu te, 
each participant was administered the inventory. 
Upon completing the inventory, the group had nu- 
merous questions concerning the device, but all 
they were told was that it was to provide base-line 


TABLE II 


MEAN SCORES, DIFFERENCE IN MEAN SCORES AND t TEST ON THE MEDICAL SCHOOL INSTRUCTOR 
ATTITUDE INVENTORY OBTAINED BY PARTICIPANTS IN THE 1959 SUMMER INSTITUTE 


шкее——————————————Є—Єї—ү—ү©үуду%—ү—————————-———— 


Section I Section П Section III Section IV Section V Section VI 

Statistic Democratic- Critical- Liberal- Appreciative- Favorable- Favorable- 

Autocratic Complimentary Traditional Depreciative Unfavorable Unfavorable 

Attitude toward Attitude toward Attitude toward Attitude toward Attitude toward Attitude toward 
Teaching Medical Schools Medical Education Medical Students Full-time Teachers Part-time Teachers 
I 

Pre-Test Mean 124 51 73 38 33 23 
Post-Test Mean 131 53 76 40 33 23 
Difference +7 42 +3 +2 0 0 
t 3.50 1.68 2.26 3.44 -- E 
Significance .01 NS .05 .01 NS NS 
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data on its standardization. They seemedto be sat- 
isfied with this answer. During the next nine days, 
only one or two questions regarding the inventory 
were asked, and by and large the group forgot the 
experience. Then оп һе last day, when it was a- 
gain administered, the participants were suddenly 
aware that this was a pre- and post-test situation. 


* It is to the credit of this group that they reacted to 


this realization with equanimity; but with their sci- 
entific background, their willingness to be part of 
an educational experiment could be anticipated. 

Table II is the result of this pre- and post-test- 
ing. For each section of the inventory there are the 
pre- and post-mean scores and the plus or minus 
differences between the means. Seemingly, the re- 
sults are self-explanatory and need only little elab- 
oration, Ы 

The greatest change, obviously, occurred inSec- 
tions I and IV with less change in Section III and no 
significant change in Sections II, V and VI. This is 
a reasonable outcome, for as previously stated, the 
teaching objectives of the institute leaders were 
clearly directed toward the broad topics of teaching 
and student which are encompassed in Sections I and 
IV. Consideration of the subjects, medical educa- 
tion and medical schools, was brief andincidental 
tothe major objective. No consideration was given 
to the issue, part- or full-time faculty, except as it 
occurred in private conversations. However, in ex- 
amining the individual scores onthese last two sec- 
tions, it was interesting to note that individual 
changes in pre- and post-test scores were almost 
evenly distributed— some participants became more 
favorable, others more unfa vo rable, toward each 
other. 

On the basis of this table, for Sections I and IV, 
(and Section III if the .05 level of confidence is ac- 
cepted) the null hypothesis can be rejected and the 
hypothesis can be accepted—attitudes in sucha ““зо- 
phisticated'' group as medical school instructors 
can be changed during the course of an educational 
program.? 

The implications of this study are far-reaching. 
It has long been an objective— and reality —that 
schools develop certain attitudes in their students. 
In most cases, teaching objectives do not specific- 


ally spell out the development of attitudes, but the 
acquisition of these attitudesis more often an acci- 
dental rather than intentional outcome. We have 
come to accept this for elementary and secondary 
education, but are the teachers involved in this ed- 
ucation immune to the devel opment or changing of 
attitudes? Hardly. For some graduate schools take 
time to define objectives that call for the develop- 
ment of attitudes, andteachers are continuously at- 
tending these schools. Yet, whenthe same question 
is asked regarding teachers in a medical school, or 
any professional school for that matter, the answer 
is not as clear-cut if not as often in the negative. 
The common stereotype of a medical school faculty 
has it as a fairly rigid group, confining its teaching 
methods to lecture, laboratory, conference and 
ward round. Individual differences often manifest 
themselves, but by and large the same teaching fol- 
lows year after year. In the chain of ‘‘passing the 
buck", the medical school faculty is the ultimate 
link with an opportunity to place blame on someone 
else for the inadequate preparation of its students, 
More often than not, they look on themselves as de- 
fenders of the last bastion of ‘‘academic discipline." 
In admitting the presence of this attitude, it is no 
wonder that doubts might arise as to the power of 
any educational program to change established atti- 
tudes, or develop new ones insuch agroup. The 
institute at Buffalo, with its clearly defined objec- 
tives, gives testimony to the possibility that no one 
(assuming an open mind) is too old or too rigid to 
change his attitudes when presented with facts. 


FOOTNOTES 


1. For a complete content summary of the institute 
see George E. Miller and Edwin Е. Rosinski, 
“А Summer Institute on Medical Teaching: A 
Report of a Conference," Journal of Medical 
Education, Volume 34, May, ‚рр. -95. 

2. The difference in mean scores cannot be attrib- 
uted to the inventories lack of reliability, for 
reliability coefficients for the various sections 
аге in the .80’s and .90's. One of the real 
attributes of the inventory is its high test-re- 
test reliability coefficients. 
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AN EXAMINATION OF TEN SIGNIFICANT DIFFERENCES 
BETWEEN JUVENILE RECIDIVISTS AND NON-RECIDIVISTS 


Introduction 


Attempting to identify significant differences 
between juvenile recidivists and non-recidivists is 
a difficult task, asresearchers in theareahave 
found. The desirability of such an operation сап 
readily be seen in the potential implications for in- 
stitutional rehabilitative measures, andcurrent pa- 
role practices. Ina current study by the author+ , 
the above problem became the crux of an attempt to 
construct a prediction check list that could be used 
by juvenile training institutions. 


In this study recidivists were defined as 
those parolees from a juvenile training institution 
who were returned to the institution for violation of 
parole, who were recommitted to the state Division 
of Youth Service, or who appeared in Superior Court 
while on parole, and were sentenced to a higher in- 
Stitution. Non-recidivists, conversely, were con- 
Sidered to undergo none of these actions. Those ju- 
veniles who may later have been convicted of 
an offense as anadult, although actually recidivists, 
were not considered as such in this study. 

The population used for thestudy was drawn 
from the total population of Lyman School for Boys, 
Westboro, Massachusetts. This is an “ореп” insti- 
tution for boys from the ages of eleventoseventeen 
operated by the Massachusetts’ Division of Youth 
Service. Two sample groups were drawn at ran- 
dom from the total populationthat was paroled from 
the school from January, 1953 to December, 1956. 
The earlier period of time was chosen to permit 
more of a follow-up on each individualafter parole. 

Both sample groups were divided into recid- 
ivists and non-recidivists according to the definitions 
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mentioned earlier. The first group consisted of 100 
recidivists and 100 non-recidivists selected from the 
1955-56 population. The second group consisted of 
100 recidivists and 100 non-recidivists drawn from 
the 1953-54 population. The only restriction placed 
on the members of the two sample groups was that 
they be in the institution for the first time. In other 
words, those individuals who were already proven 
to be recidivists were eliminated to provide pure 
data for the study. 

Although the ultimate goal of the study - to 
establish a method of prediction - proved to be un- 
obtainable at this time, some progress was made in 
adding to the store of knowledge about recidivism. 
As a result of statistical treatment of the data, ten 
characteristics proved to show significant differ- 
ences between recidivists and non-recidivists at the 
. 01 level of confidence. 

These ten characteristics were the result of 
researchintoanoriginal 163 items representing in- 
vestigation into sevengeneralareas - psychological, 
School, home, family, physical, training school ad- 
justment, and parole. These ten maintained their 
Significance in both sample groups. 


The Ten Characteristics 


For the benefit of those engaged in research 
into this area, each of the ten is worth a brief anal- 
ysis in terms of possible causes, and implications 
for those presently working directly with juvenile 
delinquents. 

1) Height - the non-recidivist tended to be 
taller than the recidivist. Тһе meanheight in inches 
for the non-recidivist was 63, while for the recidi- 
vist, it was 60. This characteristic, as in many 


lpitwack, Lawrence, The Construction and Validation of anInstrumentfor the Prediction of Recidivism Among 


Juvenile Delinquents in Massachusetts. Unpublished Doctoral Dissertation, Boston: Boston University School 


of Education, 1959. 
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of the others, would not necessarily hold true for 
older boys, but it proved to be true for this group 
who were mainly in their early adolescent period. 
The implication here seems to be that boys 
whose physical development is slow, and whocannot 
maintain physical equality with their chronological 
contemporaries, willfrequently attemptto compen- 


. Sate for their lack of stature. This compensation in 


the case of delinquents may take one of several 
forms. Тһе individual may remain with his chron- 
ological peer group, and maintain his status in the 
group by becoming overly aggressive and/or daring, 
to prove that he is as good as his contemporaries. 
Alternately, the individual may move to a younger 
group in which his size is no longer a factor, andin 
which his increased maturity may thrust him into 
the role of the leader of the group. 

2) Age of mother at time of boy's commit- 
ment - As a group, the mothers of the successes 
tended to be older than the mothers of the failures. 
The mean age in months of the mothers of successes 
was 485, while for mothers of failures, the mean 
age was 454. 

Analyzing this factor proves to be more dif- 
ficult. It would seem that the mothers of failures, 
being younger, were more likely to be working, or 
to be engaged in other activities of a social or extra- 
marital nature, Тһе mothers of successes, being 
older, tended to be more inthe home, enabling bet- 
ter supervision of the youngster. The younger 
mothers also tended more towards broken marriages 
and immoral activities. 

3) Age of boy at time of first court appear- 
ance - this factor is actually closely interrelated 
with the next three, and the four can best be dis- 
cussed together. Number four is age of boyattime 
of commitment; five is age of boy at time of arrival 
at training school; and six is age of boy at time of 
parole. In all four characteristics, the older the 
boy, the more likely he is to be a success or non- 
recidivist on parole. 

The implications and possibilities here are 
a little easier to see. Itwould seem that the younger 
the boy, the more effective should be rehabilitative 
measures, since his illegal patterns have notliadan 
opportunity to become deeply ingrained. Apparent- 
ly, however, this does not seem to be the case. 
The reasons for this may vary greatly in the indi- 
vidualcase. One may lie in the better methods of 
detection which bring the individual into court at an 
earlier age. Another may be due to thefactthatthe 
individual in late latency or early adolescence is still 
in a highly formative and impressionable period. 

This raises the question as to how effective 
our rehabilitative measures really are. Some crit- 
ics have charged in the past that present-day train- 
ing schools are merely breeding places for crime. 
Regardless of the merits of this argument (if any ex- 
ist), there is little disagreement over the fact that 
Over-populated, under-staffed, untrained staff, and 
Politically dominated institutions have heavy odds 
against their being really effective in rehabilitating 


delinquent youngsters in their care. 

Still a third possibility here may be found in 
the fact that the younger the boy, the more likely is 
he to be legally required to re-enter school uponhis 
parole from the training school. This point will be 
further elaborated upon іп the discussion of item ten. 

7) School grade completed - The success 
tended to reacha higher level of educational achieve- 
ment than the failure. The vast majority of the suc- 
cess group were in grade seven or higher, while 
the majority of the failures were ingrade six or 
lower, including special or ungraded classes, 

It is well known that delinquents as a group 
have less success in school than dotheir more nor- 
mal non-delinquent contemporaries. It would appear 
that the more frustrating and unrewarding the school 
situation becomes, and the more retarded is the 
youngster educationally, the more likely is he to re- 
bel in anti-social behavior, both in and out of 
school, 

8) Father ever in jail - The fathers of suc- 
cesses tended to be more law abiding and freer of 
court records and jail sentences. The fathers of 
failures tended more towards lengthy court records 
and/or actual imprisonment. One implication here 
is that the boy learns by example, and follows the 
footsteps of his father. A second is that the father 
who is imprisoned is out of the home, thus removing 
the male influence upon tlie boy, and increasing the 
influence and possible domination of the mother fig- 
ure, with all its psychological implications. 

9) Number of times in discipline cottage - 
The success tended never to be placed in the disci - 
pline cottage while in the training school; the failure 
tended to be there at least once. Put another way, 
the training school adjustment for the success tended 
to be much better than for the failure. An individual 
was usually placed in the discipline cottage for run- 
ning away, or for any major infraction of school 
rules. 

The implication here seems to be that the 
boys who continue to rebel against authority even in 
a highly controlled setting suchas is.found ina train- 
ing school, are much more likely to make a poor ad- 
justment on parole, and to end up as a recidivist. 

10) Paroled to school or to work - The suc- 
cess tended to be over the legal school compulsory 
attendance age of 16, and thus was able to leave 
Schooland go to work. Тһе failure, being younger, 
was legally required to re-enter school upon parole, 

As was previously discussed, school is an 
unrewarding experience for the typical delinquent. 
Thus, returning him to this situation seems to reac- 
tivate some of the conflicts that produced anti-social 
behavior in the first place. The implication here 
would seem to be the obvious need for increased pa- 
role supervision for youngsters returning to school, 
and, more important, better understanding and more 
acceptance by the schools themselves forthose 
youngsters returning from a training school. Every 
effort should be made to help them learn and find 
satisfying experiences in school. 
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Summary 


The ten characteristics discussed inthis ar- 
ticle are by no means the only ones that might dif- 
ferentiate between recidivists and non-recidivists. 


However, these were the ones that emerged during 
the course of the study. It is hoped thatthe discus- 
Sion of the items and their possible implications 
may lead to further research and investigation into 
this area. 


Ben c" 
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A Quantitative-qualitative Analysis of Selected Social Science 
Generalizations in Social Studies Textbooks in the 


Intermediate Grades 


The Problem 


EDUCATION, IN its broadest sense, is the 
means of. . . [the] social continuity of lifel*, .. 
Without such formal education, it is not possible 
to transmit all the resources and achievements of 
2 complex society. It also opens a way toakind of 
experience which would not be accessible tothe 
young, if they were left to pick up theirtraining in 
informal association with others, since books and 
symbols of knowledge are [to be] mastered. 2 

Just as it was true when John Dewey wrote 
of it in 1916, education in contemporary society is 
the means of providing the continuity of civilization. 
As in generations past, the textbook continues to be 
the most widely used tool for this necessary goal of 
instruction. 

The object of this study was to evaluate qual- 
itatively and quantitatively the ten social studies 
textbooks most widely used in each of the interme- 
diate grades in ten selected large city schoolsys- 
tems. The textbooks, 30 in all, included 13 geog- 
graphies, ten histories, and seven fused social 
Studies textbooks, used in the intermediate 
grades---the fourth, fifth, and sixth grades. 


Procedure 


With the assistance of an interdisciplinary 
jury of scholars 15 social science generalizations 
Were selected, developed, improved, and estab- 
lished as the criteria for evaluating 30 social stud- 
ies textbooks in the intermediate grades. Size and 
geographic location determined the selection of 
large city school systems which were polled to as- 
certain which social studies textbooks were inmost 


*Footnotes will be found at the end of the article. 
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general use in these systems. This method was 
used in order to get a wide sampling of textbooks in 
use, 

An interdisciplinary jury of scholars, se- 
lected mostly from universities across the United 
States, expressed a high degree of agreement con~ 
cerning the generalizations and approval of them, 
Most of the scholars expressed strong agreement of 
the desirability of the inclusion of the following 15 
generalizations in social studies textbooks. In ad- 
dition to establishing the 15 generalizations, three 
sub-points were also set up for delimiting the gen- 
eralizations; in the interests of brevity these sub- 
points are, however, not included: 


1, The foundations of successful human rela- 
tions are freedom of worship along with jus- 
tice before the law, equality of opportunity 
for self-realization, together with economic 
advancement, and in addition, liberty of 
movement, unhindered inquiry as well as un- 
restricted communication---all of these 
within the limitations of law, national secu- 
rity, and decency. 

2. Civilization is partly dependent upon the con- 
servation and development of human and nat- 
ural resources, 

3. Family living should contribute to a mature 
personality---the expression of the quality 
of emotional, mental, physical and social 
experience, reconstructed and integrated in- 
to his being. 

4. Essential tothe welfareof a dynamic, emer- 
gent democracy is significant participation 
of the individual in community and national 
citizenship. 
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5. The lag іп the development and utilization 
of social inventions as compared with the 
widespread use of technical discoveries is а 
threat to human existence. 

6. The direction of human affairs stems partly 
from the interaction among the individuals 
and groups. 

7. In part a peaceful world depends upon the 
toleration of conflicting ideologies, each 
within its separate sphere. 

8. The danger of devastating wars ---wasteful 
of life, property, and other resources —- 
necessitates an international organization to 
adjust disputes arising out of conflicting in- 
terests. 

9. Interdependence of population and the wise 
use of resources which they share increase 
with the advancement of science. 

10. Growth of population has resulted from uti- 
lizing science to increase longevity as well 
as to control and ameliorate disasters. 
Science and technology have distinctly modi- 
fied man's efficiency to utilizethe resources 
of the earth and the significance of climate. 
12. Since people must live under varied natural 
circumstances, human survival normally de- 
pends upon developing and adapting behavior 
to suit given environments. 
13. Physical and cultural diversities have, in 
part, occasioned great economic, political, 
and social issues of modern times. 
Discoveries and inventions mustbe shared in 
the interests of optimum human well-being. 
15. Technological advancement increases man's 
opportunities for recreation andcreative ex- 
pression. 


11 


14 


The technique of appraisal was that of the 
word count rather than that of counting lines or pages 
of materials devoted to a certain generalization. 
Certain rules were set up as guides. Inthe first 
place, each word was usually considered to be a unit; 
a compound noun like New York City or Christopher 
Columbus was counted as one word inthe evaluation. 
Captions under a picture or map were not counted in 
the word count of the total number of words pertain- 
ing to any of the generalizations. Paragraph head- 
ings, likewise, were not counted for this purpose. 
The title page, foreword, table of contents, and in- 
dex were excluded from the evaluation. All materi- 
als in the textbooks were divided into two basic cat- 
egories, textual and non-textual. Only verbal mate- 
rials, designed to impart information to the reader, 

was classified as textual, Non-textual material, on 
the other hand,. included all other categories, such 
as pictures, maps, graphs, charts, tables, activi- 
ties, and tests. Sampling some areas of the textbooks 
as a basis for evaluating them was not the method 
used in this study; every page and every word have 
been considered. 

A definite procedure was followed in respect 
to each of the 30 textbooks. А preliminary summary 


was done on each textbook. The data included an 
over-all view of a textbook, in respect to the total 
number of words of textual material, word displace- 
ment of non-textual materials, the number of words 
per average page, and the average number of words 
per square inch or word displacement scale. These 
data were recorded ona Separate card for each book, 
This information was necessary for an evaluationof 
the textbooks, 

The second stage in this research consisted 
in reading carefully each textbook and recording the 
number of words of textual matter or square inches 
of non-textual matter (later converted into words 
displaced). Two types of work sheets were used for 
recording the data. These work sheets provided a 
way of recording both quantitativeand qualitative 
data. Profiting from the review of the literature on 
textbook research, converting square inches of ma- 
terial into words displaced was in the judgment of 
the analyst the most precise way of comparing tex- 
tual and non-textual data as well as books withprint 
of different sizes. Qualitatively, references to the 
15 generalizations were recorded inwords or Square 
inches as excellent, good, fair, poor, or negative, 
Each of these terms expressing a quality was further 
defined. Excellent meant admirable or to the high- 
est degree. Good signified satisfactory, fair sug- 
gested average, poor expressed unsatisfactory, and 
negative designated opposite. 

Recording the references to the generaliza- 
tions was precisely done. Work sheet I provided 
Space for recording wordage of textual material and 
Square inches of non-textual material for each of the 
15 generalizations under the qualities of excellent, 
good, fair, poor, and negative. Work sheet II was 
used when a second sheet was necessary and for 
Summarizing the data. Writing the page and the 
number of words, for example, 113-21W, under one 
of the five qualities left a record of the quality and 
the quantity in the sum of 21 words on page 113. 
Non-textual materials were marked under one of the 
five qualities with the page of the reference preced- 
ing the quantity, ог 197-22''x 5'', Later the square 
inches were computed and converted into words dis- 
placed. In making a summary of textual materials, 
the total number of words referring to a generaliza- 
tion was recorded under each quality. Non-textual 
materials were recorded inwords displaced by com- 
puting the space in square inches and multiplying by 
the word displacement scale. Horizontal totals pro- 
vided the number of words, regardless of the quality, 
which referred to a particular generalization. Ver- 
tical totals furnished the number of words in any one 
of the five qualities without reference to a specific 
generalization, Atthe same time, the analyst hada 
record of the number of words of each generalization, 
placed under the proper quality, as expressed in both 
textual and non-textual materials, 

In order to perfect the method of analysis, 
the analyst selected one book and analyzed it twice. 
A time lapse of approximately six months between the 
two analyses eliminated memory as a factor. 
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Although the two analyses did notagree inall 
respects, the examiner considered this procedure 
valuable. In bothtextual and non-textual materials, 
more ratings were recorded as poor or negative in 
the second analysis. Also the total number of words 
of references to the generalizations was smaller in 
the second than in the first analysis. The value of 

' the second analysis was to make the examiner more 
critical in rating references under one ofthe five 
qualities and cause her tolook for a clearer expres- 
sion of each generalization. The investigator felt 
such a procedure would insure a more even and ac- 
curate analysis of the remaining 29 textbooks. 


Findings 


1. The quantity and quality of the attention to 
15 generalizations was very uneveninthe 30 
textbooks. А considerable quantity of word- 
age was devoted to generalization No.11, 
whereas almost no wordage comparable to 
No. 13 was found in the textbooks. 

2. Definitely a trend toward more non-textual 
wordage was established; non-textual word- 
age accounted for the large number of words 
expressing No. ll. Muchofthis wordage was 
expressed in quite large pictures. Thequal- 
ity of wordage inclined more toward excel- 
lent and good than toward negative and poor, 
with the exception of No. 11 which inclined 
more toward negative. Тһе quality of non- 
textual wordage was more than three times 
as much as textual wordage. Тһе generali- 
Zations found in smallest quantity---Nos. 5, 
7, 10, and 13- —were present in textual word- 
age but not іп non-textual. 

3. Some generalizations showed a quantitative 
trend in the textbooks onthe three grade lev- 
els. In three cases out of four the general- 
izations present in the least wordage---Nos. 
5, 7, 10, and 13---appeared in the sixth- 
grade textbooks. The generalizations pres- 
ent in largest quantity---Nos. 1, 2, 9, 11, 
and 12---were present in largest wordage in 
proportionto the total words in the books on 
the fourth-grade level. 

4. Quantitatively and qualitatively no pro- 
nounced trends appeared respecting fus e d, 


geography, and history textbooks; the one ex- 
ception was that the four low generaliza- 
tions---Nos. 5, 7, 10, and 13---were en- 
tirely absent in history books. 
5. No. 11 was accorded the most words and No. 
13, the fewest. Almost three times as much 
wordage was judged to convey the meaning of 
No. 11 as the generalization nearest to it in 
quantity, namely, No. 12. No. 13 had less 
than one-fourth as much wordage as No. 10, 
the second lowest generalization. 
Articulation of textual and non-textual mat- 
ter was uneven. There were many instances 
in which non-textual matter was not closely 


related to textual wordage; this partly ac- 
counted for the large quantity of non-textual 
word displacement in contrast to textual. 
Generally speaking, the quantity of the gen- 
eralizations as expressed in most of th e 30 
textbooks was judged to be inadequate, when 
compared with the Canadian-United States 
textbook study. In textbook research,the 
Canadian- United States project is considered 
a high type piece of research. Three books 
which devoted from 11 per cent to 14 per cent 
of space to the substance of these generali- 
zations were consideredadequate, whencom- 
pared with the Cariadian-United States text- 
book study. Some portion of the wordage 
stating the equivalent of the 15 generaliza- 
tions was stated in each of the five qualities, 
but wordage in no quality was emphasized 
overwhelmingly at the expense of the others. 
In view of the above statements 27 books 
contained the equivalent of the 15 generali- 
zations in unsatisfactory quantity. 


6 
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FOOTNOTES 


1. Dewey, John. Democracy and Education. New 
York: The Macmillan Company, 1916. 418 pp. 


2. Dewey, John. Democracy and Education. New 
York: The Macmillan Company, 1916. p. 9. 
3. Hart, Albert Bushnell. SchoolBooks andInter- 
national Prejudices. International Conciliation 
pamphlet] No. 38. New York: American Asso- 
ciation for International Conciliation, 1911. 
13 pp. 
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The Strong Vocational Interest Blank and 


Occupational Proficiency ' 


VOCATIONAL INTEREST authorities have hypo- 
thesized that a definite relationship can be expected 
between interests and proficiency in a given occu- 
pation. Only а few such studies have been attempt- 
ed. Either an insufficient number of cases, an in- 
adequate criterion of proficiency, or both have re- 
sulted in failure (2. The usual technique is to 
study one occupation. Superior and inferior sub- 
groups of the occupation are defined on the basis of 
criterion performance. Anintraoccupational inter- 
est scale is then constructed on the basis of signif- 
icant response differences for the two subgroups. 


The Problem 


For a proposed study, three hypotheses have 
been suggested: a) Occupational members can be 
classified with respect to quality of performance; 
b) A superior-inferior interest scale can be con- 
Structed for the occupation; and c) the superior-in- 
ferior interest scale can be validated on primary 
and control groups. 

The specific questions to be answered by this 
preliminary study are: 

1. Is there a substantial relationship between 
vocational interests and occupational proficiency? 

2. Is there а reasonable probability of establish- 
ing the stipulated hypotheses? 


Procedures 


Liaison was established with Federal Civil Ser- 
vice examiners in metropolitan areas. The author 
made direct contact with examinees who appeared 
for scheduled stenographic examinations. Applica- 
cants who met the following basic qualifications 
were sought for the study: female between the 
ages of 18 and 22; with little or no stenographic ex- 
perience; with approximately two years’ training 
in a stenographic curriculum; and employing phon- 
ography rather than phonotypy (machine shorthand). 

A total of 247 subjects qualified for and were 
used in the preliminary study. Each applicant was 
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furnished two self-addressed, stam p ed envelopes 
for the convenience of mailing to the investigator 
а) the completed interest inventory and b) thesub- 
sequently reported performance test score. 

The criterion was cons idered shorthand/trans- 
Scription performance, as required by theCivil 
Service test. Such performance constituted the test 
of proficiency. The performance score was based 
upon an objective scoring of the transcript by the 
Civil Service Commission. The performance test 
employed was one especially designed by the Com- 
mission--an article, dictated for three minutes, at 
а speed of 96 words a minute, requiring transcrip- 
tion by means of a typewriter. The Commission 
reported an interform overall reliability of from .93 
to.96 forthe test. Validity was evidenced by the 
nature of the performance task. 

The subjects who passed the proficiency test, 
scoring at least 70 percent, were considered ‘‘su- 
регіог,”” and those scoring below this divisional 
score were labeled ‘‘inferior. ”’ 

The data were subjected to the chi square (x^) 
technique. The degree of response divergence with 
respect to the two dichotomies was sought. The in- 
formation was treated in a 2 x 2table in which “L” 
represented one category and “1” and “D” were 
combined. ‘‘L’’ for the **superior" members of the 
occupation constituted the ref er ence point for the 
Scale. Anunweighted unit-scoring method was ap- 
plied. Each item which favored lik ing by ‘‘super- 
ior" members was scored “41”; others were scored 
'**0." For items to be included in the differentiat- 
ing scale, the level of confidence was arbitrarily 
Set between one and two percent. 


Findings and Discussion 
ылымы ақыны ысыды 


1. А superior-inferior interest scale was con- 
Structed on the basis of 94 ite ms, or 23. 5 percent 
of the items included in the Strong Vocational In- 
terest Blank for Women. The scale items were 
based on P< .02, a X* value of at least 5. 412 (see 
Table I). 


* This is a report of a preliminary, or pilot, study conducted in 1952, as the basis for the author's Ph. D. dis- 


sertation . 
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TABLE I 


DISTRIBUTION OF LEVELS OF SIGNIFICANCE OF THE 400 ITEMS FROM THE 
SVIB (W), COMPUTED BY X^ (2x2 TABLE, 1 df, N = 247) 


= o 


x Cumulative 
Item Frequency ZELUS T ЕТІЛЕ АТ S CO Item Frequency Percentage 
249 > .05 .0158 - 3.841 400 100. 00 
5" .05 -.02 3.841 - 5.412 151 37.75 
51 .02 -.01 5.412 - 6.635 94* 23. 50* 
43 ^ S .01 6.635 - 23.214 43 10.75 


Sn I Ue ыны 00 OAS 


* Basis on which the preliminary superior-inferior interest scale was constructed. 


TABLE II 


DISTRIBUTION OF PRODUC T-MOMENT CORRELA TION COEFFICIENTS BETWEEN SCORES 
ON THE SVIB (W) AND SCORES ON THE PROFICIENCY TEST, BASED ON SIGNIFIC ANCE 
LEVELS OF DIFFERENCES IN RESPONSES TO THE INTEREST ITEMS 


к=——=———————————————————к——ЖС—Ж—Ж——=—————Е—Е—Е—Е———————————————————— 


Frequency 
Non-cumulative Cumulative P r 
249 400 > 05 . 087 
57 151 05 - 02 ‚411 
51 94 02 - .01 835 
43 43 x 01 ‚812 
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2. The mean proficiency percentage score for 
the 247 subjects was 74.9, withastandard deviation 
of 1.5; the mean raw interest scale score was 60.2, 
with a standard deviation of 2. 7. 


3. The proficiency and interest scores gavea 
Pearsonian product-moment correlation coefficient 
of .835+ . 004 (See Table II). 


Intrinsic and extrinsic co rrelates of vocational 
proficiency were not controlled. There is some 
question as to whether 70 percent is a suitablecrit- 
ical score. The method of combining “I” and “D” 
responses for comparison with ‘І’ responses is 
questionable in the light of better scoring techniques 
available. The arbitrary setting of a significance 
level may be in line for more definitive reasoning. 
The relationship between the two variables, without 
control of related variables, does not permit accu- 
rate correlation coefficients. Inaneffort to resolve 
these possible criticisms, however, reference is 
made to the purpose of this preliminarystudy. Only 
indication of the two-variable (interest and profic- 
iency) relationship was sought. Anade quate basis 
is furnished for proceeding with the proposed major 
study. 


Summary and Conclusions 


A preliminary study was designed to determ i ne 
the indication of a relationship bet ween vocational 
interests and occupational proficiency. The sub- 
jects were 247 Federal Civil Service applicants for 
Stenographic positions. The criterion of proficiency 
was defined as shorthand recording andtranscribing 
Skill. The criterion was measured by means of a 
stenographic work sample. Interests were measur- 
ed by means of the SVIB (W). The technique involv- 
ed the construction of a superior-inferior interest 
scale for shorthand writers. Significant response 
differentiation was employed. Interest inventories 
were scored on the basis of the scale. Pearsonian 
product-moment correlations were computed by use 
of scores on the two variables. 

Pursuant to the design of the study, the following 
conclusions appear to be warranted: 

1. There is a substantial relationship between 
vocational interests and occupational proficiency. 

2. There is a better-than-chance probability 
that the stipulated hypotheses of the proposed major 
study can be established. 


10. 


11. 
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EDITORIAL 


TEACHING MACHINES..... 
Shall We Test Them? 


It has been many times observed that if education is to keep pace with a 
rapidly changing and expanding world, there must be major break-throughs in 
school education. A revolution in school education is, in fact, now in prog- 
ress. One needs onlyto read almost anywhere to catch the flavor ofthetimes. 
Within this expanding educational universe is the move to provide teachers 
with better equipment, and with new learning and teaching aids. One of the 
very important new tools to be made availableto teachers is the teaching ma- 
chine. It is merely one of many new mechanical aids to learning. 


In keeping with the tradition of The Journal of Educational Research, we 
welcome reports of carefully controlled experimental studies of teaching ma- 
chines. We believe that this research might well be of three sorts: 


e 1) Exploratory studies. The purposes served by such studies are many, 
but there is need, first of all, for research that will make clear the needs 
served by each machine, the types of learning with which it is helpful, and the 
conditions under which it has produced desirable pupil growth and achievement, 
In offering this comment, it is assumed that machines will not be all things to 
all men, but have value with reference to particular specified types of pupils, 
purposes, and conditions that control their utilization. 


e. 2) Cost studies. Assuming that teaching machines will make an impor- 
tant addition to the tool kits of teachers, one might correctly inquire at what 
cost. The reference here is not merely to monitory matters. What will be 
the gains and losses accompanying the use of teaching machines? Most inno- 
vations involve both losses and gains. What will be the cost in terms of time, 
money, energy, and values that we consider important? 


e 3) Comparative studies. Cost studies and exploratory studies need to 
precede comparative studies, or become avital part of them. It does not seem 
to make much sense to com pare one machine with another machine or one 
means with another means until the needs served by each machine or means 


and the costs have been specified. 


The Journalof Educational Research, over the y ears, has carried re- 
ports of many researches that have pertained to the ‘‘how-to-do-it”’ aspects 
of school education. Sometimes the question investigated in these researches 
have not been well formulated. Frequently, the research seems to ask ques- 
tions similar to that which one might ask a carpenter, for example: Which is 
better, a screw-driver or a hammer? Anyway, we believe teaching machines 
offer a splendidopportunity for some important researchin the realm of school 
education. The Journal of Educational Research would be pleased to receive 


reports of systematic studies of this new and important innovation. 
A. S. Barr 


f October 1961 
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EDUCATION TELEVISION GUIDE BOOK, Philip 
Lewis. New York: McGraw-Hill Book Com- 
pany, 1961. xii - 238 pp. 


This is a methodical presentation of the many 
facets of educational television. It begins with basic 
elements and proceeds to explain their applications 
and inter-relationships. It will serve as a ready 
guide in developing the effective team work which 
educational telecasting requires. 

At the outset the author accepts the premise 
that television can teach effectively, and proceeds 
to delineate the methods of using it. Experiences 
based on actual case studies are reviewed to illus- 
trate the handling of specific problems related to the 
art of effective presentation. 

As may be expected from an author who has 
had long experience in the audio-visual field, and 
from a work produced in cooperation with the Elec- 
tronics Industries Association, this volume embodies 
what is virtually a show case of television-related 
equipment and its uses. It is generously illustrated 
with pictures, diagrams, andcharts. The television 
medium is put into perspective with other devices. 

Station operation and program planning are 
touched upon rather lightly. Administrative, pro- 
gram, production, and engineering job responsibil- 
ities for both closed circuit and on-the-air Opera- 
tions are described. Actual situations are cited to 
illustrate approved practices. 

Equipment needs for on-the-air and closed- 
circuit operations are explained in detail, as are 
those of kinescope and video tape recording activi- 
ties. Translators, networking, multi-channel broad- 
casting and microwave relays receive attention as 
they relate to particular situations or problems. 

Program origination and control equipment are 
described in complete detail. Camera types and 
characteristics, lighting, monitors, microphones, 
Switcher-faders, and other related components are 
all included. 

Checklists of equipment needs, together with 
prices of items required for various kinds of instal- 
lations are provided for guidance in determining 
costs. These are related to actual cases, and are 
described in considerable detail. 

As a textbook for students this volume pre- 
вепів a wealth of pertinent information and ideas. 


Book Reviews 


For reference, it embodies a glossary of terms, a 
bibliography of publications, reports and periodicals, 
and a list of organizations associated with educa - 
tional television. 

School board members, administrators, and 
others concerned with opportunities and alternatives 
related to the establishment of educational stations 
wil find this book guiding them from the A-B-C's 
through the XYZ's of this challenging new medium. 

Harold A. Engel 
University of Wisconsin 


TEACHING FORSELF-EDUCA TION: Asa Life Goal, 
Paul Douglass. New York: Harper and Broth- 
ers, 1960. 141 pp. 


Paul Douglass, a professor of government at 
Rollins College and head of the Center of Practical 
Politics has performed a service to the field of adult 
education by writing this definitive examination of 
the thought and writings of William S. Learned, a 
brilliant member of the staff of the Carnegie Found- 
ation for the Advancement of Teaching for over thir- 
ty years. Learned's professional interest c entered 
on a critical reappraisal of the learning process, 
particularly the weaknesses that appear at the college 
level. 

Shortly after Learned began work at Carnegie, 
The Divisionof Education at Harvard University in- 
augurated a series of studies with the publication of 
his history of the German schoolmaster. Learned 
stressed the needfor professionalism among teach- 
ers. His vision was '*of a time when the teacher who 
Shapes our careers shall be even more rigorously 
selected, more amply and purposefully trained, and 
more highly responsiblefor his performance than he 
who mends our bodies or untangles our personal re- 
lations. ” Learned believed that Plato's provision 
that the head of the state should be the director of 
education expressed the ultimate in a ‘‘completed 
democracy.’ He extolled the state of Vermont for 
compensating the commissioner of education 
more highly than any other state official including 
the governor. 

Learned lamented the credit hunting of our college 
and university students, andsetas the objective for 
the college student the acceptance of responsibility 
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for self-education. He developed an increasing con- 
viction that a student's intellectual momentum can be 
generated by the cultivation of four principles: 1) sin- 
cerity anddirectnessofaim, 2) thorough discovery, 
3) cooperation towardthe attainment of an individual 
and self-sought goal; and 4) honest objective apprais- 
. al of results. With sound advice for educators of 
adults Ben Wood recalled that Learned held the con- 
viction thatteachers should spend half of their time 
ascertaining the abilities, interests, and needs of 
their students as individuals, and the rest of their 
time helping students meet their needs, exploit their 
abilities, and developtheir interest to the optimum. 

Learned was concerned about what he called 
the ‘‘Ritual of the Mark". His criticism was that 
under the present system the **sole question is 
whether by hook or crook one can pass the arbitrary 
deadline with a collection of tokens thatare like- 
wise arbitrary.'' Hequestioned the whole procedure 
of segmentation and discontinuity of the individual's 
knowledge, sotypical of our extreme departmental- 
ism. 

Learned wanted tests to examine the whole 
range of knowledge regardless of the students pro- 
gram. Books andcourses should turn on lights. The 
importantthingistheactualthinking the student does 
for himself about whathesees. This the tests should 
measure andshould have no necessary relation to 
Courses. Тһе basis for intellectual morale is to be 
found in the grip that sequences of important, educa- 
tive ideas have on the mind. The prerequisites for 
its development are time, freedom, and encourage- 
ment to continuous thought. Most of the above he saw 
as conspicuously lacking in colleges as they are pre- 
Sently organized. 

Learned will perhaps be best remembered for 
his attempt at a profile of the permanent and avail- 
able equipment of a student through the Graduate Re- 
cord Examination. Thiswas conceived in Learned’s 
mind as a pilot project, andthenconsumed his ma- 
jor energies during hismature years. Though some 
may remember Learned as a cold, meticulous test 
technician, Douglass believes nothing could be fur- 
ther from the truth. Learned was fundamentally a 
philosopher. For him, the Graduate Record Exam- 
ination was merely one tool designed to give a more 
exacting intellectual inventory. He felt one of the 
weakest spots in higher education is our admission 
policies. Learned provided the early and strongest 
stimulus toward the growth of objective testing pro- 
grams in higher education. 

One of the valuable parts of the book is the 
bibliography listing the major writings of Learned 
including his classic on The American Public Li- 
brary and the Diffusion of Knowledge. The book 
should prove useful for those interested in cultivat- 
ing continuous learning as a way of life. 

As Learned pointed out, this must begin in our 
Colleges and universities, where too often curios ity 
is stifled instead of cultivated and nurtured. 

Roger W. Axford 
University of Wisconsin-Milwaukee 


THE CREATIVE ARTSIN AMERICAN EDUCATION, 
Thomas Munro and Herbert Read. Cambridge: 
Harvard University Press, 1960. 65 pp. 


In two insightful and concise essays developed 
for HarvardastheInglis and Burton lectures, Mun- 
ro and Read attempt to analyze the place and value 
of the arts in today's society. Munro, in Тһе Inter- 
relation of the Arts in Secondary Education,speaks 
from his background of experienceas a teacher about 
the current neglect of the arts because of the pres- 
sure placed upon students to achieve vocational goals. 
According to Munro, thearts lend themselves to in- 
terrelating into the curriculum of the adolescent in 
such a way as to enhance the adolescent as a healthy 
organism able to contribute to the ongoingness of 
civilization. 

Read, in The Third Realm of Education, is 
concerned with the alienation inour society resulting 
from the dehumanizing effect of the machinery nec- 
essary for mass production. Read sees the arts as 
a means of understanding the principle of play, of 
giving man the opportunity to express the freedom 
that is his. 

Thesetwo essays should appeal to those con- 
cerned with artin today’s curriculum and to a wider 
audience concerned with man as a creative being. 

Louise M. Berman 
University of Wisconsin-Milwaukee 
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| It Says in the JER... 


WHAT EFFECT DOES THE SIZE OF THE SCHOOL HAVE ON PROGRAMS for the gift- 
ed child? Durr surveyed 316 Michigan elementary schools and found: 1) Larger 
Schools from larger systems are more likely to have planned programs for the 
gifted. 2) Small schools use partial segregation more often than large schools, 
while large schools use extra-curricular activities more often than small schools. 
3) Although a wide variety of identification procedures areused, schools inlarge 
systems use group intelligence tests to a greater extent than do schools in small 
systems. 4) Schools in small systems are more likely to feel hindered by inad- 
equate finances in establishing programs, while schoolsinlarger systems are 

. more likely to say they do not need a planned program or thatthey have too few 
gifted to warrant it. Increasing interest in the gifted will probably lead to more 
planned programs, Durr believes, particularly as the trend toward larger 
Systems accelerates. 


IF YOU HITCH YOUR WAGON TO A STAR, WILL YOU EARN BETTER GRADES? Not 
necessarily, Red, McCary, and Johnson report. Thethree Houston researchers 
Studied 101 freshmen and sophomores to determine the degree of relationship 
between some measures of aspiration and actual academic achievement. They 
found little connection. The optimists and pessimists did not differ substantially | 
in their GPA’s. On the basis of their findings, the authorsdo not recommend 
the inclusion of a ‘‘level of aspiration" test in a battery designed to predict 
academic achievement. 


HOW AVAILABLE IS SPECIAL EQUIPMENT FOR EDUCATING THE GIFTED? 
Applbaum surveyed outstanding U.S. high schools, and found that while only 12. 3 
percent of the principals feel there is no need for providing special equipment 
for use in teaching the gifted, only seven percent indicate they have such equip- 
ment available for extensive use. 


WHAT IS THE RELATIONSHIP BETWEEN TIMED AND UNTIMED SC ORES IN EDU- 
CATIONAL TESTING? Boag and Neild find at Hornell, New York, that some 
high-school students make marked changes in their relative standings when they 


are given additional time to work on a test. Therefore, they say, speed and 

power scores cannot be used interchangeably. The average student, that is, the 

slow and accurate student, comes out nearer the top when given plenty of time 
\ but suffers when there is a time limit. Changes in relative standings occur 

with considerably greater frequency throughthe middle range of scores than they 
| do at either extreme. 


HOW DO YOU “PULL OFF" A MAJOR JOB OF CURRICULUM REVISION? Specifi- 
cally, how do you build an instructional program that will help students apply 
principles from general education to professional education? In a novel case 
study at the University of Washington, Johnson describes the experience of two 
college instructors trying to become more skillful in helping student nurses 
learn to apply concepts from sociology and psychology in clinical work. His gen- 
eralizations include: 1) A sound curriculum rationale is basic to cooperative 
curriculum reform, but the human element must be carefully handled if prog- 
ress is to be made. 2) Instructors can effectively direct their attention to sig- 
nificant curriculum tasks ifthe problem areasinwhichthey work are ones about 
which they are sufficiently concerned to want to take some action. 3)If teachers 

) are expected to use the products of curriculum research, they must be involved 

in the whole process of arriving at the results. 4) The most potent obstacle 

may well be the instructors' lack of knowledge of the subject matter. 


| IS SOME STUDENT COUNSELING SOMETIMES WORSE THAN NO COUNSELING? 
Definitely yes, report Winborn and Schmidt after investigating the effectiveness 
of short-term group counseling on the academic progress of underachieving 
but potentially superior college students. The work was done on 135 freshmen 
in the academic year 1958-59 at Indiana University. Results: The students who 
did not participate in a short-term group counseling program made significantly 
higher GPA's than did the participants inthe program. ‘‘This indicates," say 
the researchers, ‘‘that short-term group counseling can produce a negative ef- 
fect on the academic achievement of potentially superior freshmen," Another 
finding: The majority of freshmen who were designated potentially superior but 
underachieving students could not be identified as such onthe basis of informa- 
tion available to university officials prior to receipt of first-semester grades. 


SS ы 


| ARE PERSONALITY FACTORS RELATED TO COLLEGE SUCCESS? Very definitely, 
says Centi. In his study he found that highest ranking students at Fordham Uni- 
versity tend to be better adjusted than are lowest ranking students. He points 
out, however, that the relationship between emotional adjustment and level of 


achievement is probably not a direct one. 


DO “РОР QUIZZES" HELP STUDENTS LEARN? Not American government at West 
Texas State College. Selakovich reports no significant difference in achieve- 

| ment between students who take frequent tests and students who do not. The 

\ (Continued on page 168) 
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Provisions for the Gifted in Relation to School Size 
and System Size at the Elementary Level 


SURVEYS OF PROVISIONS for the gifted are us- 
ually quantitative reports of responses to question- 
naires or descriptive reports of program in select- 
edcities. Severalhavereported the results of 
questionnaires to school systems within selected 
States. A survey in Ohio showed that 11% of the 
School systems were making provisions for the gift- 
ed (4). A majority of the school systems in Wiscon- 
Sin reported provisions being made for the gifted 
through some form of grouping or counseling and 
guidance (3). Half of the elementary schools in 
Michigan reported special provisions for gifted 
children (1). 

Surveys of widely selected schools throughout our 
country have shown that a majority of schools were 
making some provisions for the gifted (6, 7). А 
number of descriptive reports of programs in se- 
lected cities have also been published (2, 5). They 
describe the ways that selected school systems try 
to meet the educational challenge of their most able 
Students. 


Purpose and Population 


This study is also based on questionnaire re- 
Sponses from a selected population; however, it ex- 
plores relationships whichhave not been examined 
in previous reports. It explores the relationships 
between the size of school systems (number of ele- 
mentary schools), the size of elem entary schools 
(number of Students), and provisions madeor pro- 
posed for the gifted. 

Specifically, it explores the relationships be- 
tween the two variables of system size and school 
Size and 1) special provisions for the gifted, 2) the 
kinds of provisions and methods of identification for 
those which do have special programs, 3) the rea- 
Sons and plans for those which do not make special 
Provisions, and 4) the kinds of provisions schools 
Would like to make if there werenolimiting factors. 

These relationships were determined for a ran- 
domly selected group of elementary schools which 
Were representative of the system size and school 
Size for all elementary schools in Michigan. 


WILLIAM K. DURR 
Michigan State University 


Procedure 


Selection of sample: A total of 1518 elementary 
schools in Michigan were chartedby school size and 
system size. One-third of these schools were then 
selected. Although the original tabulation was con- 
trolled to insure a representative sam pli ng of the 
two variables, the final selection of one school out 
of every three was made randomly within each cate- 
gory. This selective process resulted in 506 schools 
being chosen to receive a questionnaire. 

Questionnaire: A four-item questionnaire was 
sent to each of these 506 schools. Item one asked if 
the school had a planned program for mentally su- 
perior children. Item two was to be answered only 
by those schools which indicated that th ey did have 
such a program. It consisted of two sub-checklists: 
one concerning the type of program and another con- 
cerning methods of identification. Examples were 
given for each item when it was necessary for a 
common understanding by respondents. 

Item three was answered only by those schools 
which did not have a planned program for the gifted. 
It asked for the reason that no program was in op- 
eration and whether the school planned to initiate a 
program for the mentally superior. 

The last item on the questionnaire was answered 
by all those responding. It was achecklist asking 
for the kinds of programs the responders would fa- 
vor if there were no limiting factors such as finan- 
ces or space. 

Responders were not required to sign their 
names. They could do so, however, if they wanted 
а copy of the results. Each questionnaire was coded 
to indicate school size and system size if it was not 
signed. 


Analysis and Interpretation of Data 


The significance of differences in the answers 
was determined by chi-square. Chi-squares were 
not computed if the theoreticalfrequency of any cell 
was less than five. The five percent level of confi- 
dence was selected as an acceptable level to denote 
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significance. 

Five hundred and six questionnaires were mailed 
and 316 (62%) were returned. These returns were 
checked by system size and school size against the 
original sampling to be sure that they were repre- 
sentative of all the elementary schools in Michigan 
on these two variables. The obtained chi-squares 
(1.83 and 1.54) were not significant which indicated 
that the returns did not differ significantly from the 
original sample in the two variables. 

At the time of this study, the median system size 
in Michigan was five elementary schools. Systems 
with from one to five elementary schools were clas- 
sified as small and systems withfrom sixto 185 el- 
ementary schools were classified as large. The 
median school enrollment for all elementary schools 
in Michigan was 444 students. Those schools with 
from 32 to 444 students were classified as small 
and schools with from 445 to 1981 students were 
classified as large. 

In all tables, the observed frequencies and the 
theoretical frequencies are shown. The theoretical 
frequencies are given in parentheses. 

In response to ће дчевііоп “Бо you have a 
planned program for mentally superior children in 
your school?"', 155 (49%) indicated that they did and 
161 (51%) indicated that they did not. 

In Table I, the responses to this item are shown 
by system size and school size. In both cases, the 
differences were significant. Schools in s ystems 
with six to 185 elementary schools were more like- 
ly to have planned programs for the mentall y su- 
perior than schools in systems with one to five ele- 
mentary schools. Schools with 445to 1981 students 
were more likely to have planned programs for the 
mentally superior than schools with 32 to 444 stu- 
dents. 

The schools which did have planned programs for 
the gifted were then asked to indicate the kinds of 
provisions they made. They checkedoneor more of 
the following methods of p roviding for the gifted: 
complete segregation (special classes all day), par- 
tial segregation (special classes part of thetime), 
acceleration (e.g. moving children ahead by ‘‘skip- 
ping a grade"), enrichment (provisions made with- 
in a regular classroom), or extra-curricular acti- 
vities (e.g. science club for selected childre n). 
The observed and theoretical frequencies by school 
Size and system size are shown in Table II. 

The observed frequencies gain added m e aning 
when they are compared with the observed frequen- 
cies in Table I. There it was shown that planned 
programs were provided by 49 schools in systems 
with one to five elementary schools and 106 schools 
in systems with six to 185 elementary schools. Al- 
So, 60 schools which had from 32 to 444 students 
and 95 schools which had from 445 to 1981 students 
stated that they had planned programs. 

The first part of Table II shows no significant 
differences between large systems and sm all Sys- 
tens in the kinds of provisions that were made for 


gifted. 

The second part of Table II shows significant dif- 
ferences between large schools and small schools 
in the kinds of provisions that were made for the 
gifted. Small schools were significantly morelike- 
ly to use extra-curricular activities. Although en- 
richment in the regular classroom was by far the 
most popular method of providing for the gifted, 
there were no significant differences between large 
systems and small systems or between large schools 
and small schools in its use. 

Chi-squares were not determined forthose 
checking complete segregation because the theoret- 
ical frequencies in one or more cells was below five. 
However, it is noteworthy that eight of the large 
Schools and only one of the small schools indicated 
that they used complete segregation. 

The schools which did have planned provisions 
for the gifted were then asked to check one or more 
of the identifying methods shown in Table III. 

Teacher judgment, achievement tests, and group 
intelligence tests were the most popular methods of 
identification, in that order. However, in only one 
instance was there a significant difference between 
small systems and large systems in the methods 
used. Large systems used group intelligence tests 
to a significantly greater degreethan did small sys- 
tems. 

There were no significant differences inthe iden- 
tification methods used between large schools and 
small schools. 

The 161 schools which did not have planned pro- 
visions for the mentally superior were asked to in- 
dicate the reasons. The results of this item are 
Shown in Table IV. 

Again, a better understanding of thistable can be 
achieved by referring to the ob ser ved frequencies 
in Table I. There it was shown that 88 schools in 
Systems with one to five elementary schools and 73 
Schools in systems with six to 185 elementary 
Schools stated that they did not have planned pro- 
grams. Also, 87 schools which had from 32 to 444 
Students and 74 schools which had from 445 to 1981 
students stated that they did not have planned pro- 
grams. 

A total of 20 schools did not clearly state a rea- 
Son. These were distributed by system size and 
School size so that they did not affect thesignifi- 
cance of any differences. 

Most of the respondents listed more than one 
reason. Table IV shows those which were most fre- 
quently listed alone or in conjunction with others. 
Lack of adequate finances and lack of physical fa- 
cilities were most often listed. 

There were three reasons for no planned pro- 
grams which differed significantly by sy stem size. 
The small systems listed lack of adequate finances 
significantly more often than did the large systems. 
On the other hand, no planned program needed, and 
not enough gifted children, were listed more fre- 
quently by the large systems than by the small sys- 
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TABLEI 


PLANNED PROGRAMS FOR THE GIFTED 


ЕЕЕ ЕЕЕЕЕЕЕЕЕ 


By number of elementary schools in the system 


1-5 Schools 6-185 Schools Chi-square Significance 

Make provisions 49 (67.2) 106 (87.8) 
16.9 1/2 95 level 

Do not make provisions 88 (69.8) 13 (91.2) 


By number of students in a school 


32-444 Students 445-1981 Students Chi-square Significance 
Make provisions 60 (72.1) 95 (82.9) 
7.45 1% level 
Do not make provisions 87 (74.9) 74 (86. 1) 
TABLE II 


KINDS OF PROVISIONS MADE FOR THE GIFTED 
———— MM 


By number of elementary schools in the system 
1-5 Schools 6-185 Schools Chi-square Significance 


------------------------------------------------------------:--:------:-:---:: 


Complete segregation 4 (2.85) 5 ( 6.15) 

Partial segregation 10 (13. 28) 32 (28. 72) 1.62 Not significant 
Acceleration 6 (7. 27) 17 (15. 73) .45 Not significant 
Enrichment 42 (39.83) 84 (86.17) .92 Not significant 
Extra-curricular 16 (21.18) 51 (45. 82) 3.26 Not significant 


By number of students in a school 
32-444 Students 145-1981 Students Chi-square Significance 


a 


Complete segregation 1 (3. 48) 8:05:52) 

Partial segregation 22 (16. 26) 20 (25. 74) 4.54 5% level 
Acceleration 8 (8.90) 15 (14. 10) 17 Not significant 
Enrichment 45 (48.77) 81 (77. 23) 2.54 Not significant 
Extra-curricular 19 (25. 94) 48 (41. 06) 5.34 2 1/2 % level 


I, 


TABLE III 


METHODS OF IDENTIFYING THE GIFTED 


= 


By number of elementary schools in the system 
1-5 Schools 6-185 Schools Chi-square Significance 


a eee UR c6 даан ЕЗ 


Individual intelligence test 24 (26.55) 60 (57. 45) -78 Not significant 
Group intelligence test 36 (41. 73) 96 (90. 27) 7.43 195 level 

Achievement test 42 (42. 68) 93 (92. 32) .12 Not significant 
School grades 28 (26. 87) 57 (58. 13) .15 Not significant 
Teacher judgment 42 (43.31) 95 (93. 69) .50 Not significant 


By number of students in a school 


32-444 Students 445-1981 Students Chi-square Signif icance 
EAR uos ou em 3 a 


Individual intelligence test 34 (32.52) 50 (51.48) .24 Not significant 
Group intelligence test 50 (51. 10) 82 (80. 90) .26 Not significant 
Achievement test 53 (52. 26) 82 (82. 74) 243 Not significant 
School grades 28 (32.90) 57 (52. 10) 2.64 Not significant 


Teacher judgment 51 (53. 03) 86 (83. 97) 1.09 Not significant 


(4516 


НОЧУЧ5ЧЯ 'IVNOLLVOQGS ЯО 'IVNHOOP JHL 


Lack of adequate finances 
Lack of physical facilities 
No planned program needed 
Lack of qualified teachers 


Not enough gifted children 


TABLE IV 
REASONS FOR NO PLANNED PROGRAMS 


=_———————Є—Є————ї—ї——————Є—Є—Єїү{&—Є—Є—Є—ЄүЄ&ү———— 


By number of elementary schools in the system 
1-5 Schools 6-185 Schools Chi-square Significance 


u—Ó——À—————M———Á— Ó—M—Á сы ыш іс. 


44 (32. 80) 16 (27.20) 
24 (21.86) 16 (18.14) 
12 (17.49) 20 (14.51) 
11 ( 8.75) 5( 7.25) 

4( 8.75) 12 ( 7.25) 


Ву number of students in a school 
i-square 


Lack of adequate finances 
Lack of physical facilities 
No planned program needed 


Lack of qualified teachers 
Not enough gifted children 


32- Students - Students 
34 (32.42) 26 (27. 58) 
22 (21. 61) 18 (18. 39) 
14 (17.29) 18 (14. 71) 
т ( 8.65) 9( 7.35) 
8 ( 8.65) 8( 7.35) 


13.45 


.27 
.02 
1.70 


-76 
.12 


1/2% level 

Not significant 
5% level 

Not significant 


2 1/2% level 


Significance 


Not significant 
Not significant 
Not significant 


Not significant 


Not significant 


auna 
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tems. Although lack of physical facilities and lack 
of qualified teachers were frequently listed as rea- 
sons, there were no significant differences between 
the small systems and the large systems in the 
number of times they were listed. 

There were no Significant differences between 
the small schools and the large schools in any of the 
reasons given. 

The schools which did not have planned programs 
for the mentally superior were further asked if they 
planned to initiate such programs. Table У shows 
the responses to this question. A total of 42 schools 
stated that they intended to start programs and 119 
schools stated that they did not intend to start pro- 
grams. There were no significant differences be- 
tween the large systems and the small systems or 
between the large schools and the small schools in 
the responses to this item. 

All 316 of those who returned the questionnaire 
were asked to check the method or methods that they 
would favor for the mentally superior child if there 
were no limiting factors such as finances, space, 
etc. The results of this item are shown in Tables 
VI, VII, VIII, and IX. 

Table VI shows the total of all responses. En- 
richment, extra-curricular activities, and partial 
segregation were the methods that were most com- 
monly checked. However, there wereno significant 
differences in system size or school size inthese 
responses. 

Table VII again shows the total of all responses. 
Here, however, the responses of the 155 schools 
which had programs were compared with the re- 
sponses of the 161 schools which did not have pro- 
grams, without regard to system size or school 
Size. There were no significant differences be- 
tween the schools which had programs and the 
Schools which did not have programs inthe kinds of 
provisions they would like to make. 

Table VIII shows only the preferences of the 155 
Schools which had programs atthe time of the study. 
These were compared by sy stem size and school 
Size. In four instances, the chi-squares were not 
computed since the theoretical frequencies were be- 
low five. Of the schools which had planned рго- 
grams, the schools in large school systems were 
significantly more likely to want partial segregation 
than were the schools in small school systems. 
There were no significant differences by system 
size or school size in the other indicated preferen- 
ces. 
Table IX shows only the preferences of the 161 
Schools which did not have programs. Again, in 
four instances, the chi-squares were not computed 
Since the theoretical frequencies were below five. 
Of the schools which did not have planned programs, 
the schools with the greater number of students 
were significantly more likely to want enrichment 
than were the schools with fewer students. There 
were no significant differences by system size and 
School size in the other indicated preferences. 


Summary of Findings 


Some significant differences were found when 
Schools in large systems were compared with 
Schools in small systems and when large schools 
were compared with small schools. 

1. Were planned programs offered for the gift- 
ed? Although about half of the schools indi- 
cated that they had planned programs, 

a. elementary schools inlarge systems were 
more likely to have planned programs for 
mentally superior students than elementa- 
ry Schools in small systems. 

b. elementary schools with large student 
enrollments were morelikely to have 
planned programs for mentally superior 
students than elementary schools with 
small student enrollments. 

2. What kinds of programs were provided? А1- 
though enrichment in the regular classroom 
was the overwhelming choice of all schools, 
a. small schools used partial segregation 

more often than large schools. 

b. large schools used extra-curricular ac- 
tivities more often than small schools. 

3. What methods of identification were used? 
Although teacher judgment, achievement tests 
and group intelligence tests were widely used 
for identification, 

a. Schools in large systems used group intel- 
ligence tests to a greater extent than 
Schools in small systems. 

4. What were the reasons that some schools did 
not have planned programs? Although no 
single reason was given by a majority of all 
Schools reporting, 

a. schools in small systems listed inadequate 
finances more often than schools in large 
Systems. 

b. schools in large systems listed no planned 
program needed or lack of gifted children 
more often than schools in small systems. 

5. What programs would be preferred if there 
were по limiting factors? Although enrich- 
ment, extra-curricular activities, and par- 
tial segregation were preferred by over half 
of all schools, 

a. of the schools which had programs at the 
time they reported, schools in larger sys- 
tems preferred partial segregation to 4 
greater degree than schools іп small sys- 
tems. 

b. ofthe schools which didnot have programs 
at the time they reported, large schools 
preferred enrichment to a greater degree 
than small schools. 


Interpretation of Findings 


Although this study was limited to the state of 
Michigan, care was taken to insure that the results 
were representative of school size and system size 
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TABLE V 


INTENTIONS TO START PROGRAMS FOR THE GIFTED 
———————ÉÓÓÉÉááÁááÉ——————-J.]"í«4.]7Piioic—— 


By number of elementary schools in the system 
1-5 Schools -185 Schools Chi-square Significance 


——————ÉÁÉÉ——————M— MM ——— ———ÁÉÁ———————————————- 


Intend to start a program 26 (22.96) 16 (19. 04) 
1.20 Not significant 
Do not intend to start a program : 62 (65.04) 57 (53. 96) 


By number of students in a school 


- Students - Students Chi-square Significance 
————————————— a E ee ьш 
Intend to start a program 21 (22.70) 21 (19. 30) 

.38 Not significant 
Do not intend to start a program 66 (64. 30) 53 (54. 70) 
TABLE VI 


PREFERRED PROGRAM IF THERE WERE NO LIMITATIONS 
(Total of АП Questionnaires) 


ЕЕН 


By number of elementary schools in the system 
1-5 Schools 6-185 Schools Chi-square Significance 


————————————————————————————————M———————————— 


Complete segregation 7 ( 8.24) 12 ( 10. 76) "SD Not significant 
Partial segregation 75 (75.87) 100 ( 99. 13) .04 Not significant 
Acceleration 8 ( 7.37) 9( 9.63) .01 Not significant 
Enrichment 91 (93. 65) 125 (122.35) .42 Not significant 
Extra-curricular 91 (90. 18) 117 (117.82) .04 Not significant 


By number of students in a school 


32-444 Students 445-1981 Students Chi-square Significance 
CNN uL А жилы ент ее дилемма ор Ni or 20 
Complete Segregation 9( 8.84) 10 ( 10. 16) .01 Not significant 
Partial Segregation 78 ( 81.41) 97 ( 93.59) . 60 Not significant 
Acceleration 8( 7.91) 9( 9.09) .01 Not significant 
Enrichment 97 (100. 48) 119 (115. 52) X Not significant 
Extra-curricular 94 ( 96. 76) 114 (111. 24) . 43 Not significant 
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TABLE УП 


PREFERRED PROGRAM IF THERE WERE NO LIMITA TIONS 


(Those who had programs compared with those who had no programs) 


Those who now Those who do not Chi-square Significance 

have programs have programs 
——————————————————————————ÁÀÁÉÁsÁ e$ 
Complete segregation 11( 9.32) 8( 9.68) .63 Not significant 
Partial segregation 83 ( 85.84) 92 ( 89. 16) .41 Not significant 
Acceleration 9( 8.34) 8( 8.66) 11 Not significant 
Enrichment 105 (105. 95) 111 (110. 05) .05 Not significant 
Extra-curricular 107 (102. 03) 101 (105. 97) 1.39 Not significant 
Ee | 

TABLE VIII 


PREFERRED PROGRAM IF THERE WERE NO LIMITATIONS 


(Those who had programs) 


By number of elementary schools in system 


-5 Schools - Schools Chi-square Significance 
Zomplete segregation 5( 3.48) 6 ( 7.52) 
-artial segregation 20 (26. 24) 63 (56. 76) 4. 68 5% level 
\cceleration 2 ( 2. 85) 7 ( 6.15) 
;nrichment 34 (33.19) 71 (71. 81) .09 Not significant 
:Xtra-curricular 39 (33.83) 68 (73.17) 3.73 Not significant 


By number of students in a school 


32-444 Students 445-1981 Students Chi-square Significance 
тңг EE ST obs 1020 cUm 


omplete segregation 3 ( 4.26) 8 ( 6.74) 
artial segregation 27 (32.13) 56 (50. 87) 2.88 Not significant 
>celeration 2 (3.48) Т ( 5.52) 
irichment 43 (40. 65) 62 (64. 35) .68 Not significant 
tra-curricular 42 (41. 42) 65 (65. 58) .04 Not significant 


——————————— селк ee ee RR сы: Mus эй 


TABLE IX 


PREFERRED PROGRAMS IF THERE WERE NO LIMITATIONS 


(Those who did not have programs) 


M ————— 


By number of elementary schools in system 


1-5 Schools 6-185 Schools Chi-square Significance 
Complete segregation 2 ( 4.37) 6 ( 3.63) 
Partial segregation 55 (50.29) 37 (41. 71) 2:21 Not significant 
Acceleration 6 ( 4.37) 2 ( 3.63) 
Enrichment 57 (60. 67) 54 (50. 33) 1.53 Not significant 
Extra-curricular 52 (55. 20) 49 (45. 80) 1.10 Not significant 

By number of students in a school 
32-444 145-1981 Chi-square Significance 

Complete segregation 6 ( 4.32) 2 ( 3. 68) 
Partial segregation 51 (49. 71) 41 (42.29) kt Not significant 
Acceleration 6 ( 4.32) 2 ( 3.68) 
Enrichment 54 (59. 98) 57 (51.02) 4.17 5% level 
Extra-curricular 52 (54.58) 49 (46. 42) Sr Not significant 
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for all the elementary schools in that state. Inso- 
far as these schools are representative of elemen- 
tary schools in other states, the results are applic- 
able to other states. 

The increasing interest in the gifted may lead to 
more planned programs for the gifted. If, as was 
shown in this study, larger schools and larger sys- 
tems are more likely to make such provisions, there 
is an additional factor which is working toward such 
programs, There is a tendency throughout the coun- 
try for very small schools and small systems to 
consolidate into larger units. This increased size 
through consolidation may r es ult in more adequate 
programs. 

Examining the reasons for no planned programs 
gives additional emphasis to this conclusion. 
Schools in larger systems arelesslikely to give in- 
adequate finances as a reason for lack of such pro- 
grams. Hence, it would appear that the financial 
limitations which the small systems consider a ma- 
jor block to the organization of programs would be 
less important if there were larger school units. 

There is no clear-cut ‘‘best’’ program for all 
gifted children. Each of the various programs has 
its proponents and opponents. It is note worthy, 
however, that acceleration was listed as adesirable 
method by only five percent of all schools. There 
is a considerable body of rese arch evidence which 
indicates the worth of thoughtful, moderate acceler- 
ation. It seems that this evidence has not been suf- 
ficiently transmitted to our elementary schools. 


Summary ^ 


Three hundred and sixteen s chools replied to a 
questionnaire concerning mentally superior child- 
ren. These schools were representative of school 
size and system size for all elementary schools in 
Michigan. 

Larger schools and schools from larger systems 


were more likely to have planned programs for the 
gifted. Small schools used partial segregation more 
often than large schools, while large schools used 
extra-curricular activities more often than small 
Schools. Although a wide variety ofidentific ation 
procedures were used, schools іп large systems 
used group intelligence tests to agreater extent 
than schools in small systems. Schools in small 
Systems were more likely to feel hindered by inade- 
quate finances in establishing programs while 
Schools in larger systems were more likely to feel 
that they did not need a planned program or that they 
had too few gifted to warrant it. 
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A Study of the Relationship Between Aspirational 


Levels and Academic Achievement 


IN THE PRESENT study, an attempt was made 
to determine the degree of relationship between ~ 
some measures of aspiration derived from an adap- 
tation of Cassell’s Level of Aspiration Test* and 
academic achievement. It was speculated that an 
aspiration test related to academic achievement 
might be a valuable addition to a test battery ina 
scholastic situation. 

The relationship between intelligence and aca- 
demic achievement has been generally recognized 
(1, 3, 4, 5, 7, 12); therefore, intelligence test re- 
Sults are often used as predictors of academic suc- 
cess. It seems evident, however, that other vari- 
ables, such as emotional behavior, motivation, and 
aspiration influence a student's academic achieve- 
ment (4, 6, 8, 12, 14). 

The subjects used in this study were 101 fresh- 
men and sophomore university students who had . 
completed at least 12 semester hours of course 
work. 


Procedure 


Tests administered in this study were an adap- 
tation of Cassell's Level of Aspiration Test, Otis 
Quick Scoring Mental Ability--Gamma and Revised 
Beta. 

The aspiration test is a goal discrepancy mea- 
Sure which also includes induced frustration. On 
Cassell's test, the subject was asked to estimate 
the number of X's he could mark on a form sheet 
inthirty seconds. The subject was then asked to 
make the X's and was timed for thirty seconds. The 
Subject then scored this subtest. Тһе scoring SyS- 
tem is so devised as to cause the subject to be pun- 
ished, through his score, for failure to estimate 
correctly. The subject was then asked to repeat 
the foregoing procedure for Subtest II. Subtest III 
Was presented as the same situation. However, in 
Subtest Ш the subject was allo wed twenty seconds 
to complete the task even though he was led to be- 


* The adaptation made with the pérmission of author. 
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lieve he had thirty seconds and made his estimate 
on this basis. Failure was thus induced andthe re- 
action of the subject was then noted in Subtest IV 
which was conducted in the same way as Subtests I 
and П. 

Scores were derived from the subject’s esti- 
mates (E) and revisions and the actual number of 
items completed (C) in the subtests. 

One score used was the difference between the 
number estimated in the second subtest and the 
number completed in the first subtest (E2-C1). A 
second score used was the difference between the 
estimate for the third subtest andthe number com- 
pleted in the second subtest (E3-C2). The third 
Score used was the difference between the first 
Score used in this study (E2-C1) and the second 
score (E3-C2). The fourth score used was the dif- 
ference between the subject's estimate for the fourth 
subtest, made after the short timed subtest, and the 
estimate he made for the third subtest (Е4- ЕЗ). 
This should give some indication of his.reaction to 
frustration. The fifth score used was the difference 
between the fourth estimate and the number com- 
pleted in the third subtest (E4-C3). Thesixth score 
was the difference between the number actually 
completed in the fourth test and the number com- 
pleted in the second test (C4-C2). This was com- 
puted to measure to what extentactual performance 
changed after frustration. Three other scores used 
from the aspiration test were the original estimate 
(E1), the number completed in the first subtest (C1), 
and the total on the fourth subtest (T4). 

Grade point averages were computed for each 
subject on the basis of course work done in the se- 
mester in which the tests were administered. If the 
subject was not enrolled in atleast 12hours of 
course work in that particular semester, then 
grades were recorded forlater semesters to total 
at least 12 hours. If it was necessary to use any 
part of a semester’s grades, then all grades for 
that particular semester were used incomputing 
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that individual's grade point average. Hence, some 
Students' averages werebasedon morethan12 hours 
Of course work, but none onless than 12 hours. 
Four quality points were given for each hour of A 
grades, 3 for В, 2 for С, 1for р, OforF, Ofor in- 
complete, and 0 for withdrawal. 


Results 


Tetrachoric correlation coefficients were com- 
puted between each of the aspiration level scores 
and quality point average, scoreonOtisQuick Scor- 
ing Test of Mental Ability--Gamma, score on Re- 
vised Beta, grade in freshman English, psychology 
and history. Correlation coefficients were also cal- 
culated between each score derived from the aspi- 
ration test and every other score derived from the 
aspiration test. In addition, coefficients of corre- 
lation were computed between the scores on the two 
intelligence tests, Otis Gamma and Revised Beta, 
and the students’ grades in freshman English, psy- 
chology, and history, as well as between IQ scores 
and total grade point averages. 

Table I shows the intercorrelations between var- 
iables derived from the adaptation of Cassell's 
Levelof Aspiration Test. The high positive corre- 
lation coefficient, +.79 between E4-E3 and E4-C3 
was anticipated, since both measures utilize the 
fourth estimate (E4). Too, E3 and C3, the numbers 
subtracted from E4 in each case, aresimilar, since 
through learning from earlier subtests, the sub- 
ject's estimate for the third subtest was near the 
number completed in the third subtest, even though 
subtest III was timed for 20 seconds instead of the 
30-second timing used in the previous subtests. The 
correlation coefficient of -. 74 between E2-C1 and 
(E3-C2) - (E2-C1) was spurious since both variables 
have in common the term (E2-C1). The same was 
true of the +. 40 coefficient of correlation between 
E3-C2 and (E3-C2)- (E3-C1). The correlation co- 
efficient of +. 47 between E4-E3 and C4-C2 indicates 
that there was a tendency for actual performance 
after frustration to follow the trend of the estimate 
after frustration. If the subject estimated he would 
do better on the subtest following frustration, then 
his actual performance tended to increase. If he 
estimated lower, the number completed tended to 
be less. 

Correlation coefficients between measures de- 
rived from the aspiration test and scores from the 
two intelligence tests, Otis Quick Scoring Test of 
Mental Ability —Gamma, and Revised Beta, show 
no high relationships. Тһе highest correlation was 
*. 26 between the original estimate onthe aspiration 
test and IQ on Revised Beta. This may indicate a 
slight tendency for those persons with a higher non- 
verbal intelligence quotient to be better capable of 
estimating their ability to perform a non-verbal 
ask. Other coefficients were insignificant. 

Table II shows the coefficients of correlationbe- 
ween the two intelligence tests —Otis Quick Scoring 
l'est of Mental Ability —Gamma and Revised Beta— 


and grades in English 131, Psychology 131, and His- 
tory 135, as well as total grade point average. In- 
telligence quotients on the Otis test showed high 
positive correlations with these measures of scho- 
lastic achievement, the highest positive correlation 
being +. 62 with Psychology 131. Intelligence Quo- 
tients from Revised Beta showed slightnegative 
correlation coefficients with grades in English 131 
and History 135, though a low positive correlation 
(+. 11) with total quality grade point average(G. P. 
А.). The correlation coefficient between Re vised 
Beta and Psychology 131 was +. 24, which was the 
highest positive correlation coefficient between Re- 
vised Beta and a measureof scholastic achievement. 

Coefficients of correlation between measures of 
Scholastic achievement and measures derived from 
Cassell's Level of Aspiration Test were computed. 
There seems to be little relationship between the 
two. In order to determine the value of these aspi- 
ration measures towardthe prediction of academic 
success, the two measures which would contribute 
most to a multiple correlation were entered into the 
Wherry-Doolittle Test selection method. The Otis 
alone had a correlation coefficient of +.57 with grade 
point average. The measure of aspiration which 
contributed most to the multiple correlation (C4-C2) 
raised this to +. 62. Addition ofthe second measure 
of aspiration (E4-E3) raised this to +. 63. 

On Table III, subjects are classified according 
to the estimates that were made on subtests II and 
Ш of the aspiration test. These estimates were 
compared with the performance on the subtest im- 
mediately preceding it (E2-C1 and E3-C2). If the 
estimate was higher than previous performance, the 
subject was rated +, if lower, -, and ifthe same, 0. 
Then E2-C1 was subtracted from E3-C2. Тһе re- 
sulting number was either +, -, or 0. All possible 
combinations of these three variables, E2-C1, E3- 
C2, and (E3-C2) - (E2-C1), were recorded. 

These classifications gave some indication of the 
psychological processes of the individual. If, as in 
the first classification on Table IH (+++), the sub- 
ject estimated that he would complete more X's on 
the second subtest than he had on Subtest I (E2-C1) 
and estimated that he would complete more X's on 
Subtest III than he had on Subtest II (E3-C2) and the 
there was an increase inthe number of X's estimat- 
ed above previous performance for the subsequent 
test, (E3-C2) - (E2-C1), it was assumed that the 
subject showed a trend toward optimism (as oppos- 
ed to pessimism). Trends can be noted in the first 
nine classifications. The last four do not seem to 
show a definite pattern, but are simply classifica- 
tions of subjects. 

The total number in each classification is 50 
small that comparisons areinconclusive. However, 
comparisons between those subjects whose esti- 
mates were more in keeping withtheir performance 
(realistic) and those whose estimates were not in 
keeping with their performance (unrealistic) reveal 
no appreciable difference in rank on quality point 
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TABLEI 


INTERCORRELATIONS OF VARIABLES DERIVED FROM AN ADAPTATION 
OF CASSELL'S LEVEL OF ASPIRATION TEST 


Е2-СІ ЕЗ-С2 (E3-C2)-(E2-C1) Е4-ЕЗ E4-C3 


E3-C2 +.27 

(E3-C2)-(E2-C1) -."4 +.40 

E4-E3 -.05 -.05 м 0.06 

E4-C3 -.29 +. 30 -.11 +.79 

C4-C2 -.21 +.19 +. 24 +.47 +,14 


--------------------------------------------------:--:-:-:-:-:----: 


TABLE II 


COEFFICIENTS OF CORRELATION BETWEEN SCORES ON 
INTELLIGENCE TESTS AND MEASURES OF 
SCHOLASTIC ACHIEVEMENT 


————————=——————————— Єў 


Otis Revised Beta 
Nose mc xk ————— S 
English 4.46 -.02 
Psychology 4.62 4.24 
History 4.35 -.12 
Q. P. A. +.57 +11 
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TABLE III x 


DISTRIBUTION BY TYPES OF ESTIMATES ON AN ADAPTATION OF 
CASSELL'S LEVEL OF ASPIRATION TEST 


—————————————Є—Є—Є—Є—Є—Є——Є—Є—Є{]у———Є—Є—Єї©ү————-=- 


“кес do AP ар АС. Otis Revised Beta 
© 
с gà . H4-50% ` H6 -75% H6 -75% | 
E. L4 L2 L2 
пто 6 H4 67% H2 33% H2 33% 
к ^ L2. L4 2214 Р 
ug Mc H9 43% H9 43% ^ HI2-57% 
1127 112 19 
40-7 14 H8 57% H6 493 — H6 43% 
L6 L8 18 
000: 7 - H4 579 H4 57% H4 57% 
+ 13 L3 L3 
4 Ж 
-0+ 2 Hi 50% ні 50% H2 100% ` 
| 11 Li LO 
* 4 є 
аео g 6 H3 50% Hi 16% Hi 16% 
L3 L5 L5 
250 1 HO -- Hl) --% HOt = 
L1 10. 54 L1 
m 3 H2 679 H2 67% H2 679 
Li Li L1* 
0++ 6 H2 33% H2 33% H3 50% 
L4 L4 L3 
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ж 
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average. 
fer substantially as related to grade point average. 


Summary and Conclusions 


An adaptation of Cassell's Level of Aspiration 
Test, the Otis Gamma, and the Revised Beta were 
administered to 101 freshman and sophomore stu- 
dents. Scores on these tests and grade point aver- 
ages were compared. Тһе highest correlation 
(+. 62) was between freshman psychology grades and 
the Otis test. Тһе Otis correlated +.57 with grade 
point average. АП correlation coefficients and 
comparisons between measures from the aspiration 
test and measures of academic achievement showed 
that little relationship was evident between the two. 
“Оп the basis of these findings, it would appear that 
the inclusion of this adaptation of Cassell's Level of 
Aspiration Test in a battery designed to predict ac- 
ademic achievement would not.be economically ad- 


vantageous. М 
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Instructors Learn to Help Students Apply Principles 


A CONTINUING PROBLEM in professional edu- 
cation is that of planningthe program in such a way 
that the fullest possible advantage can be taken of 
what has been learned previously, and particularly 
what students began to learn intheirgeneral educa- 
tion. The solution to this problem would appear to 
lie, in part, in a careful examination of programs 
of professional education. Such examination could 
include the identification of opportunities to maxi- 
mize the relationship betweengeneral education and 
professional education. A major purpose of this 
study has been to analyze the experiences of two 
college instructors who end eavored to become in- 
creasingly skillful in helping student nurses learn 
to apply concepts from sociology and psychology in 
clinical work. 

This task is essentially one of curriculum revi- 
sion. It is generally well-known that anyattempt 
to improve a curriculum involves going through 
some general steps of a rational approach. These 
steps involve; 1) determining objectives, 2) selec- 
tion and organization of learning experiences, and 
3) planning for evaluation. For people inexperienc- 
ed in curriculum work, it is often difficult to carry 
out this logical approach. Relatively little has been 
done to follow closely the activities of those whotry 
systematically to carry outthese procedures. This 
study has attempted to provide a detailed account of 
what actually happened over a period of time to two 
instructors engaged in a particular project of cur- 
riculum improvement. The study thus was a case 
study of a project, with particular reference to two 
instructors who became deeply involved in it. 


The Setting 
The University of Washington School of Nursing 


was currently engaged in a five-year curriculum 
research project inbasic nursing education. 


DOUGLAS A. JOHNSON 


Sacramento State College 


Through cooperative curriculum research, the var- 
ious studies in the project sought to improve the in- 
structional program in basic nursing and to shorten 
the time necessary for the preparation of a profes- 
Sional nurse. 4A three-year investigation, exploring 
the ways of relating the basic social and natural 
Sciences to clinical nursing, supplemented the more 
comprehensive five-year project. 

The three-year investigation, of whichthis study 
was a part*, was seeking answers to the following 
questions: 

1. What are the major concepts that all univer- 
sity students should begin to learn in their general 
education courses? 

2. Which of these basic concepts f rom general 
education should and can be broadened and deepened 
in professional nursing courses? 

3. How can nursing students be hel ped to apply 
these concepts from general education that have im- 
plications for professional practice? 

4. How can we evaluate to find outto what extent 
the nursing courses are helping students apply con- 
cepts from general education in solving nursing 
problems? 

5. What kind of curriculum pattern will facili- 
tate relationships between general and profession- 
al education? 

The primary concern of this study has been with 
the first three questions, and in particular, ques- 
tion three. 


Specific Questions to be Answered 


In approaching the problem of this investigation, 
the following specific questions were formulated: 

1. What are some of the difficulties that occur 
as teachers work intensively on the kinds of prob- 
lems involved in helping students apply principles? 

2. What are some ways of overcoming the ob- 
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stacles for the instructors? 

3. What are some factors that support and mo- 
tivate the teachers in the process of increasing their 
skills in helping students apply principles? 

4. What are some ways of capitalizing on and 
strengthening the positive and motivating factors 
for the instructors? 


Sources of Data 


The data for this study were gatheredinthe ped- 
iatric departments of two clinical divisions of the 
University of Washington School of Nursing, the 
Harborview-King County Division and the Tacoma 
Indian Memorial Division. This choice was made 
because the writer was affiliated with the Pediatric 
Committee of the School of Nursing. Two pediatric 
nursing instructors, who were members of this 
committee, agreed to participate in the study. 
These two were in charge of two classes with a to- 
tal of twenty-eight students. The writer worked 


with the instructors involved inthis study before the _ 


winter quarter, when the intensive researchwas 
carried on. 

The two previous quarters had been devoted to 
exploration of the concepts from sociology and psy- 
chology, content aspects of objectives for pediatric 
nursing courses, and the de v elopment of a course 
in human development taught by another instructor. 
This work was done in Pediatric Committee meet- 
ings. Some data were gathered directly from the 
Students. Sources of data included discussions 
with nursing faculty, minutes of planning sessions 
with faculty members, instructor's long-range and 
day-to-day written plans, learning experiences in 
the class and on the ward, and student logs. 


Collection and Analysis of Data 


Recordings of planning sessions withfaculty, an- 
alysis of written plans, observations of classes and 
ward practice and studentlogs were the principal 
Sources of data. The methods utilized in collecting 
data from these sources included observation, indi- 
vidual and group interviews, and examination of re- 
cords. Recordings of planning sessions and obser- 
vations proved helpful in att empting to answer the 
two questions which involved identifying the factors 
that obstructed and motivated the instructors’ 
growth. Analysis of written plans and student logs 
furnished other information concerning ways of 
dealing with these factors. 

The instructors and the investigator worked to- 
gether in an attempt to identify means for develop- 
ing their increased skillinteaching. The actual 
Process of cooperative research with the twoin- 
Structors was divided into three increasingly com- 
plex “Чеуе1в”” or phases embracing, altogether, 
twelve weeks. Each phase was approximately four 
weeks in length and called for increasingly complex 
types of planning and teaching behavior on the part 
9f the instrüctors. 


As the data accumulated, itbecame apparent that 
the motivating factors or obstacles tended to fall in- 
to the following categories: 1) knowledge of th e so- 
cial sciences, 2) understanding of tasks to be done 
in curriculum improvement, 3)interpersonal rela- 
tions and personal factors, and 4) administrative 
factors. These categories became the major foci 
of the study. 


Major Findings 


The major findings of the study are here sum- 
marized for each of the four categories. 

Knowledge of the Social Sciences. In this cate- 
gory the major obstacle identified was the instruc- 
tors’ lack of knowledge of social science concepts. 
This lack was revealed by f requent statements by 
the instructors, by the difficulties they experienced 
in utilizing certain types of instructional materials, 
and by difficulties encountered in writing ‘‘c ases” 
involving the application of social science concepts, 
At the same time, as the study progressed, the 
lack of knowledge also served to motivate the two 
instructors to become more adequately prepared in 
this area. The instructors' willingness to listen to 
recordings, read books, attend committee meetings, 
participate in conferences, and change their teach- 
ing through the use of a variety of materials, dif- 
ferent teaching methods, and resource people pro- 
vides evidence for this finding. The data concern- 
ing each of these activities contain many comments 
from the instructors relative to “һоу much social 
Science they were learning" and how their skill in 
helping students apply this material in case studies 
and other learning experiences related to pediatric 
nursing situations was increasing. 

Certain procedures appeared to be particularly 
helpful in removing the obstacle imposed by lack of 
knowledge and in capitalizing onthe instructors' de- 
Sire to learn more. One of them was the use of re- 
Source persons (a psychologist and mental health 
teacher, a sociologist, and the like) who were 
called in sometimes to work directly with the stu- 
dents and other times to work directly with the in- 
structors. A second practice found to be useful 
was that of having each instructor planspecific 
learning situations designedtohelp students acquire 
the skill of applying social science principles, A 
third useful procedure involvedscheduling meetings 
where the instructors could study instructional ma- 
terials in terms of their social science content and 
implications for the instructors' classes. 

Understanding of Curriculum and Instruction, 
The major obstacles in the area of curriculum and 
instruction were found to be thefollowing: 1) inade- 
quate conception of planning, 2) difficulty in trans- 
lating overall objectives into operational goals, 3) 
limited instructional materials, 4) mechanical 
learning experiences, and 5)satisfaction with pres- 
ent methods of teaching. 

Among the supporting factorsin this category 
were some that would have been present in the sit- 
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uation even if the investigator had not been engaged 
in the research: 1) the emotional support and tech- 
nical help of the small friendly group of colleagues 
on the Pediatric Committee, 2) the total faculty 
group participating in the Curriculum Research 
Project creating the expectancy of performance and 
assimilating the results of the instructors’ work, 3) 
the investigator's work with the Pediatric Commit- 
tee as a member of the Research Staff prior to the 
study, and 4) opportunities for the instructors to 
see the national implications of their work through 
conferences with national leaders. It was what the 
investigator did with these factors that caused them 
to support the research. 

For example, though thetotal faculty group cre- 
ated an expectancy of performance and assimilated 
the results of the instructors’ work, it was neces- 
sary to interpret to the instructors what kinds of 
things the faculty expected, help set deadlines for 
producing certain results, and the like, in order to 
capitalize on this supporting factor. In the case of 
the investigator’s work with the Pediatric Commit- 
tee prior to the study, the supporting factor was 
there, but in order to use it, the investigator had to 
to use what he found out about t he instructors’ 
strengths and weaknesses during the previous work 
with them. They, in turn, hadsomebasis on which 
tc*judge the investigator’s actions and suggestions. 
The meaning the investigator attaches tothe factors 
in this category is thatinidvidual teachers can be 
helped to gain greater skillin working on curricu- 
lum tasks if the potent motivating factor of working 
as part of a total faculty conce rned with the same 
or similar problems is present. 

Another factor capitalized on was that of having 
а curriculum consultant from another field working 
closely with the group of instructors. Even though 
the investigator, and others who worked occasion- 
ally with the instructors, were notpreparedin ped- 
iatric nursing, their services were useful to the in- 
Structors. These people helped the instructors see 
certain curriculum issues involved in situations and 
how the issues could be resolved. Still another 
factor that was used to support the instructors’ at- 
tempts to become more proficient in various as- 
pects of curriculum and instruction was that of the 
problem areas in which they were working being 
ones about which they were sufficiently concerned 
to want to take some action. The selection of spe- 
cific problems and the formulation of goals and 
ways of reaching the goals were basic to the study. 

Throughout the data reported in achapter of this 
study, evidence occurred again and again to indicat e 
that these instructors moved much faster in this 
short period of time where they actually participat- 

ed in research than they had in the preceding quar- 
ters when they had been observers. Student per- 
ceptions of what is actually occurring constituted 
another potent factor in helping instructors improve. 
The cooperative development of goals and way s of 
achieving the goals, as well as the evaluation of the 


attainment of the goals by students, in tructors, 
and consultants was animportant element in the im- 
provement process. 

A further consideration in this category isthat 
participants in a study of this nature must have a 
clear understanding of thelearningprocessin order 
to formulate and test for themselves the meanings 
of the experiences they have during the research. 
Instructors A and B worked on such varied curric- 
ulum tasks as formulation of objectives, experi- 
mentation throughout the three phases of the study 
with а variety of teaching methods and the use оға 
variety of teaching materials, evaluation procedure 
dures and the like, Theinstructors did notseem to 
be able to formulate and test for themselves the 
meanings of the experiences they were having during 
the three phases of the study until they began to 
study and discuss what these experiences meant to 
them in terms of the School of Nursing's theory of 
learning. 

Interpersonal Relations and Personal Factors. 
The major obstacles in thi s area were found to be: 
1) insecurity arising from one of the instructor's 
feelings that she had introduced new teaching ideas 
too rapidly for the class, 2) lack of warmth and a 
feeling of competition bet ween the two instructors 
involved in the study, 3) lack of understanding of the 
real problem of this investigation and the instruc- 
tor's role іп it, and 4) the con sultant's lack of un- 
derstanding and resulting impatience with aggres- 
Sive behavior displayed by one of the instructors. 

Among the supporting and m otivating factors in 
this area were: 1) the instructors had worked to- 
gether over many years and knew each other's 
strengths and weaknesses, 2) the instructors’ ac- 
ceptance of the investigator observing in their 
teaching situations, 3) the consultant's awareness 
of the instructors' needs for satisfaction and re- 
ward in their research work, and 4) the usually 
sound rapport between the investigator and the in- 
structors. 

A useful procedure for overcoming the obstacle 
of insecurity from aninstructor’s feelings that she 
had introduced new teachingideas too rapidly for 
her class involved the investigator sitting down with 
the instructor and reading student logs to determine 
whether or not she had moved too fast for the stu- 
dents. This type of cooperative evaluation of the 
situation aided the instructor in examining the facts 
more objectively than she had previ ously been able 
todo. The presence of someone in this study who 
would be sympathetic to personal problems and ad- 
ministrative annoyances and who would also help 
“е group’’ keep a balance bet ween their expres- 
sion of emotion and the jobsto be done was a valu- 
able factor in overcoming the obstacles of insecur- 
ities and lack of friendliness and feelings of com- 
petition between the instructors. (The term ‘‘emo- 
tion" is used here in the non-technical sense. The 
writer views it as meaning strong feeling, any one 
of the states designated as fear, anger, disgust, 
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grief, joy, surprise, and thelike.) Asinterper- 
sonal tensions arose, shifting the‘‘working climate” 
toward greater informality (in this case luncheons, 
evening social-work meetings) helped to refocus the 
efforts of the participants on the job to be done. 

А way of capitalizingonthe investigator's aware- 
ness of ths instructors' needs for satisfaction and 
reward in their research work involved placing the 
instructors in situations whereit was possible for 
them to gain satisfaction through recognition of their 
work as a part of an all-school study. Some of the 
situations concerned the instructors doing the re- 
porting about the study at semi-annual work confer- 
ences and other meetings. This recognition, in 
turn, helped build sound rappo rt between the con- 
sultant and the instructors and helped the instruc- 
tors feel they were involved in worth-while research. 
The fact that the instructors knew each other's 
Strengths and weaknesses made it possible for both 
the consultant and the instructors to move more 
rapidly in their research than if this had not been 
the case. The group knew this and therefore used 
this factor in dividing up committee work in terms 
of each instructor's skills and spending little time 
in exploring members' talents and weaknesses. 

Administrative Factors. Some of the major ob- 
stacles in this category were: 1) problems arising 
from dual appointments, 2) distance of the instruc- 
tors from the campus, 3) lack of adequate transpor- 
tation budgets, and 4) difficulties in obtaining re- 
source people and instructional materials. Іп gen- 
eral, the administration acted as a strong support- 
ing and motivating element to the research. This 
was evident in such means of support as the follow- 
ing: 1) procuring substantial grants-in-aid to pro- 
vide some of the conditions necessary for effective 
work, 2) provision of nationally known consultants 
in nursing and education, 3)travel funds for visits 
to other schools, 4) funds for instructional mater- 
ials, and 5) clearing class schedules so that every 
faculty member had Friday afternoon free for indi- 
vidual work, committee work, or meeting in gener- 
al sessions of the total faculty. 

The problem of one instructor holding a dual ap- 
pointment— part-time instructor and part-time ward 
Supervisor— was not removed. This arrangement 
afforded the instructor muchless time than was ne- 
сезвагу for much sound planning andteaching. The 
difficulty of one instructor feeling “оп the fringe" 
because she was 35 miles away from campus was 
partially overcome by increasing written communi- 
cations, by the investigator spending at least two 
days of each week at her division, and by the con- 
Sultants making sure theinstructor was notified of 
meetings, seminars, and the like. The problem of 
inadequate transportation budget for field trips was 
not Solved. Тһе problem of obtaining resource per- 
Sons and instructional materials was partially 
Solved by the investigator sometimes locating ma- 
terials and resource persons on campus for the in- 
Structors. The positive administrative elements 


were capitalized on in a generalfashionby the group 
in all aspects of the research. 


Conclusions and Implications 


Five major conclusions of this study are: 

1. The most potent obstacle was the instructors' 
lack of knowledge of the social sciences. There 
was no substitute available for those ideas. 

2. The obstacles to the study were such that, 
with varying degrees of success, they were over- 
come. The obstacle growing out of the instructors' 
lack of knowledge of the social sciences was great- 
ly weakened by the instructors learning some of the 
content in this area. It was possible to make rapid 
progress in this area in the three months of inten- 
Sive work. The obstacles arising in th e human re- 
lations areas were diminished to alesser degree 
than those in the social science content field. One 
can not expect a person's basic personality orien- 
tation to change radically in three months work of 
this nature. Some success was gained in overcom- 
ing the obstacles related to curriculum tasks, 
though no great changes were produced inthe short 
period of three months, 

Much effort over a long period of practice and 
experimentation is needed by the novice in order to 
make pronounced progress in fhe curriculum and 
instruction area. То people who аге not accustom- 
ed to thinking in terms of a sound rationale of cur- 
riculum and instruction, progress in this area is a 
difficult task. Some obstacles, though few, were 
overcome or weakened in the administrative area. 
One must recognize the fact that in any project 
there will be some difficulties that can not be elim- 
inated. One would be overcoming a major difficulty 
inthis area if he were to take realistic recognition 
of limitations. Seen in this manner, the limitations 
can be accepted and will not actas a major obstacle 
to the jobs to be done. 

3. Instructors can effectively directtheir atten- 
tion to significant curriculum tasks if the problem 
areas in which they work are ones about which they 
are sufficiently concerned to want to take some ac- 
tion. The problems must be real to the participants 
and not vague generalities. The selection of spe- 
cific problems and the cooperative formulation of 
goals and ways of teaching the goals are basic to a 
curriculum revision program. 

4. А sound curriculum rationale is basic to co- 
operative curriculum study, but the human element 
in the curriculum revision process must be con- 
sidered if progress is to beattained. Interpersonal 
relations, personal factors and administrative fac- 
tors all contribute to the success or failure ofa 
curriculum revision project. 

5. Ifteachers are expected to use the products 
of curriculum research, they must be involved in 
the whole process of arriving at the results. Coop- 
erative action research aidsin developing commit- 
ment to research results and changes of practice in 
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the classroom. 

Someimplications for further study are: 

1. There is little reason to believe that studies 
similar to this one would not encounter similar ob- 
stacles. For example, lack of knowledge of a con- 
tent area. It would be best for those persons who 
would engage in such studies to anticipate and plan 
in advance ways of meeting the obstacles and then 
taking the necessary steps to remove them. 

2. In each of the areas of this study some ob- 
stacles were overcome and some were not. A study 
is needed to determine how these obstacles that 
were not overcome in this research could be over- 
come in a similar situation. 

3. High emotion іп а group is not all bad. In 
many cases it is worth while in that it indicates to 
the leader a high work potential for the group if the 
energy is properly channeled. A skillful group 
leader can use this energy if he anticipates this and 
prepares himself in the techniques of dealing with it. 


4. Other studies, all of which are interrelated, 
are recommended: 


a. Conduct a similar study inpediatric nurs- 
ing with other groups of students and teachers at the 
University of Washington so that large enough num- 
bers of students and teachers can be studied to ob- 
tain more nearly adequate generalizations con- 
cerning the four questions of the study. 

b. Similar studies in other clinical areas at 
the University of Washington School of Nursing. 

c. Similar studies in pediatric nursing andin 
other clinical areas in other schools of nursing. 
(When these studies have been completed, sufficient 
data should have been gathered so that generaliza- 
tions for the total population of schools of nursing 
might emerge. ) 

d. Similar studies in teacher education, en- 
gineering education, legal education and inother 
professional schools. 


ІП Ше JER c.a ve a... 


students themselves, however, were in favor of “рор quizzes" because, as one 
said, “They keep me on the ball." The quizzes did tend to stimulate more 
class discussion, but Selakovich detected a tendency to quibble over minor 
points rather than attack problems of broader significance. 


CAN YOU PREDICT A STUDENT’S PERFORMANCE IN MATHEMATICS ON THE 
BASIS OF HIS ATTITUDE TOWARD THE SUBJECT? Not too well, says 
Cristantiello. He investigated the correlation of ACE Q-scores with mathe- 
matics grades among Fordham University sophomores, found there was a con- 
sistently higher positive correlation among neutral-attitude students than for 
positive or negative attitude students. He also found interesting correlations 
with a student's affinity for a subject other than math. ‘It would seem," the 
author concludes, ‘‘that the use of the ACE Q-score as a predictor of success 
in mathematics should be re-evaluated. ”’ 
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The Effectiveness of Short-term Group Counseling Upon the Aca- 


demic Achievement of Potentially Superior but 


Underachieving College Freshmen 


BOB WINBORN 
North Texas State College 


RECENT NATIONAL and international cir- 
cumstances have focused attention on a portion of 
the population of the United States characterized as 
Superior, talented, or gifted students. Manpower 
shortages, internationalcircumstances, andgrowth 
and change in the scientific, professional, andspe- 
cialized fields have contributed to the increasing 
awareness of the importance of superior students to 
the well-being of the United States. Gardner (1) has 
Suggested that this nation is only beginning to real- 
ize that one of the marks of a modern complex so- 
ciety is the great demand for educated talent. To- 
day, efforts are being made to identify the superior 
Students of this country, to recruit them for institu- 
tions of higher education, to improve their edu- 
cational programs, and to provide various types of 
mediate and long-range programs for their ben- 
efit. 

One of the problems in the development of 
the talented resources of the nation has been the 
failure of many superior students to reach their po- 
tentiality. The Educational Policies Commission 
(2) has stated that the most critical waste of Amer- 
ica’s manpower resources are students of highabil- 
ity who leave school early, who do not go to college, 
or who drop out of college prematurely. Consider- 
able research has been devoted to underachieving 
Superior students in attempts to determine what 
factors cause these students to fail to reachthe lev- 
el of performance of which they are capable and to 
investigate possible ways of assisting superior stu- 
dents to overcome the factors that contribute to un- 
derachievement. However, as Beasley (3) has 
Pointed out in a review of the literature onun- 
derachievement, research in this area has been 
largely inconclusive and often contradictory. 

The increasing importance of superior stu- 
dents to the Society of the United States has created 
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a need for understanding the various problems con- 
comitant with superior students who are undera- 
Chievers. However, the rising enrollments of insti- 
tutions of higher education and the lack of qualified 
personnel have brought an economic problem to bear 
on how to work effectively with large numbers of un- 
derachieving students. Because of the above factors, 
research in the area of group counseling was under- 
taken to determine the value of this process inrais- 
ing the level of academic achievement of potentially 
superior students. 


The Problem 


The problem considered inthis investigation 
was to determine the effectiveness of short-term 
group counseling upon the academic achievement of 
underachieving potentially superior college students. 
The investigation was designed to ascertain whether 
there was a significant difference in academ- 
ic achievement between a selected group of students 
receiving short-term group counseling anda similar 
group of students receiving no counseling. Specifi- 
cally, the procedure followed during the investiga- 
tion was adopted to test the following null hypothe- 
sis: There will be no difference in the academic a- 
chievement of a group of students receiving short- 
term group counseling and a group of students re- 
ceiving no counseling. 


Methods and Procedures 


All freshman students who(1)enrolled for 
the second semester of the 1958-1959 academic year 
at Indiana University, (2) scored a totalscore onthe 
American Council on Education Psychological Exam- 
ination (1952 edition) at or above the 80th percentile, 
and (3) made a first semester grade-point average 
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of 1.50 or below (based on a 3. 0 formula) were con- 
Sidered to be potentially superior students who were 
underachieving in their academic studies. One hun- 
dred thirty-five potentially superior freshmen were 
identified as underachievers. А random sample of 
68 students was drawn from this population of 135 
freshmen and designated an experimental group. The 
remaining students in the population comprised 
a control group. 

Two counselors were responsible for the 
counseling of the experimental group which was e- 
qually divided between the counselors by means of a 
random sample. Each counselor divided his group 
into several smaller groups to facilitate the group 
counseling process, and six smaller groups were 
formed from the experimental group for counseling 
purposes. Both counselors had substantialprior ex- 
perience in counseling and were considered to be 
capable and qualified to conduct the group sessions. 

Prior to ће counseling of the experimental 
group, the California Psychological Inventory was 
administered to both the experimentalandcontrol 
groups. Students inthe experimentalgroup partici- 
pated in a short-term group counseling program 
consisting of six counseling sessions extending over 
a period of approximately two months. Eachsession 
was about one hour in length. The content of the 
Sessions was relatively unstructured, with only brief 
outlines for each sessionbeing usedto assist par- 
ticipants in thinking in terms of improving their ac- 
ademic achievement and to aid in the initiation of 
discussion by members of the group. Following the 
conclusion of the group counseling sessions, the 
members of the experimentaland controlgroups were 
again administered the California Psychological In- 
ventory. 

At the conclusion of the investigation, com- 
plete data were available for 34 males and 25 females 
of the experimental group and for 33 males and 26 
females of the control group. Seventeenfreshmen of 
the original 135 underachievers either declined to 
participate in the investigation or withdrew from the 
university during the semester. 

А description of the entire group of 135 stu- 
dents included in the original population of undera- 
chievers was made on the basis of certain personal, 
Social and academic characteristics. The data for 
this description were obtained from the Counseling 
Center of Indiana University and were of sucha 
nature that refined statistical treatment was not pos- 
Sible. 


Findings 


Following the collection of the data, ап an- 
alysis of the results of the investigation revealed the 
following findings: 


1. No significant differences were evident 
between the experimental and control groups for 
mean ACE total scores, first semester grade-point 
averages, and scores made onthree selected scales 


of the California Psychological Inventory prior to 
the counseling of the experimentalgroup. These 
findings are reported in Table 1. Apparently, the 
randomization of the population into experimental 
and control groups resulted in two comparable groups 
as judged by the factors tested for significant differ- 
ences. The three scales of the CPI were selected 
since they were designed to measure variables as- 
sumed to be associated with achievement. 

Differences between the means and t ratios 
were also computed for the ACE total scores, first 
Semester grade-point averages, and the scores 
made on the three selected scales of the CPI by (1) 
males of the experimental and controlgroups, (2) 
females of the experimental and control groups, (3) 
males and females of the experimental group and 
(4) males and females of the controlgroup. This 
was done to further determine the validity of the ran- 
domization technique. A difference in mean ACE 
total scores between males and females of the con- 
trol group was found to be significant at thefive per 
cent level of confidence. Mean ACE total scores 
were found to be significantly higher for the males 
of the control group than for the females of this 
group. There were no significant differences for the 
other factors tested in the above categories. 


2. After the experimental group had partic- 
ipated in a short-term group counseling program, 
differences in mean grade-point averages between 
the experimental and control groups were found to 
be significant at the five рег cent level of confidence. 
The mean grade-point average of the control group 
was found to be significantly higher than the mean 
grade-point average of the experimental group. No 
significant differences between the experimental апа 
control groups were found inscores made on the 
three selected scales of the CPI. These findings are 
reported in Table II. 


Following the counseling of the experimental 
group, differences between the means and t ratios 
were also computed-for second semester grade-point 
averages and the scores made onthe three selected 
scales of the CPI by (1) males of the experimental 
and control groups, (2) females of the experimental 
and control groups, (3) males and females of the ex- 
perimental group, and (4) males and femalesof the 
control group. Since a significant difference was 
found between the mean grade-point average of the 
experimental group and that of the control group, 
the possibility was examined that this difference was 
affected significantly by either the males or the fe- 
males of the experimental and control groups. NO 
significant differences were found, however, be- 
tween any of the above categories ingrade-pointav- 
erages or Scores made on the three scales of the 
CPI. 

The possibility thatgrades made by counsel- 
ees of either of the two counselors could have had а 
aignificant influence on the grade-point average for 
the experimental group as a whole was also 
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TABLE I 


DIFFERENCES BETWEEN THE MEANS OF THE ACE TOTAL SCORES, GRADE POINT AVERAGES, AND 
SCORES MADE ON THREE SELECTED SCALES OF THE CPI BY THE EXPERIMENTAL AND 
CONTROL GROUPS PRIOR TO COUNSELING THE EXPERIMENTAL GROUP 


Mean Mean Mean Mean 
ACE Mean Ac Ai Ie 
Group Score* GPA Score Score Score 
Experimental 
(N-59) 62.42 1.07 24.29 18.59 38. 73 
Control 
(N-59) 62.68 1.04 25.49 19.34 39. 83 
Difference .26 . 03 1.20 .15 1.10 
t ratio .407 .351 1. 337 „979 1.224 


*ACE total scores represent T scores which were converted from percentiles, GPA equals grade point 
averages, Ac equals the Achievement via Conformity scale of the CPI, Ai equals Achievement via Inde- 
pendence scale of the CPI, IE equals Intellectual Efficiency scale of the CPI. 
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TABLE II 


DIFFERENCES BETWEEN THE MEANS OF THE GRADE-POINT AVERAGES AND 
SCORES MADE ON THREE SELECTED SCALES OF THE CPI BY THE EXPER- 
IMENTAL AND CONTROL GROUPS FOLLOWING COUNSELING OF 
THE EXPERIMENTAL GROUP 


Mean Mean Mean Mean 
Group G.P.A.* Ac Ai Ie 
Score Score Score 
Experimental 
(N = 59) .95 25.39 18.98 39.51 
Control 
(N = 59) 1.20 26.17 19.68 40.37 
Difference .25 .78 -10 .86 
t ratio 2:217 4+ .776 .920 .906 


ж G. P. A. equals grade point average, Ac equals the Achievement via Conformity 
Scale of the CPI, Ai equals the Achievement via Independence scale of the CPI, 
Ie equals the Intellectual Efficiency scale of the CPI. 


** Significant at the five per cent level of confidence. 
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examined. There were no significant differences in 
mean grade-point averages between counselees of 
the two counselors either prior to or following the 
short-term group counseling sessions. 


3. Orientation test results and high school 
grade-point averages of the majority of participants 
in this investigation were not sufficient for the iden- 
tification of these freshmenas underachievers prior 
to their admission to the university. 


4. On the basis of available data, no single 
factor such as age, campus residence, part-time 
employment, or familial background was observed 
to be an apparent contributor to the underachieve- 
ment of the majority of potentially superior students 
who participated in the investigation. 


Conclusions 


The prime reason for this investigation was 
to test the null hypothesis that short-term group 
counseling would not effect, significantly, the aca- 
demic achievement of a group of students who were 
counseled as compared with a similar group of stu- 
dents who were not counseled. This hypothesis was 
not supported by the findings of this investigation. 
In view of the findings, the following conclusions are 
presented. 

1. The students who did not participate in 
the short-term group counseling program made sig- 
nificantly higher grade-point averages than did the 
participants in the counseling program. This indi- 
cates that short-term group counseling tended to 
produce a negative effectupon the academic achieve- 
ment of potentially superior freshmen as reflected 
by their academic grades. 

2. Scores made on three selected scales of 
the CPI which were designed to measure variables 
assumed to be associated with achievement did not 
reflect any significant differences between partici- 
pants in a short-term group counseling program and 
Students who were not counseled. 

3. No particular subgroup, such as males, 


females, or counselees of a certain counselor, ap- 
peared to have caused the significant difference in 
academic achievement between students who were 
counseled and those who did not receive counseling. 

4. The majority of freshmen who were des- 
ignated potentially superior but underachieving stu- 
dents could not be identified as such onthe basis of 
high school grades, orientation test scores, or oth- 
er information available to university officials prior 
to the time these students received first semester 
grades at the university. 

5. Observable factors that could contribute 
to the underachievement of a majority of the poten- 
tially superior freshmen were not in evidence when 
personal, social, and familial data that are usually 
available to university officials were examined. 


The following recommendations for addition- 
al research are based on the findings of this inves- 
tigation: 

1. Research should be designed to develop 
more adequate methods of identifying superior stu- 
dents who are potential underachievers. 

2. Consideration should be given to the 
study of the effectiveness of intensive group coun- 
seling upon academic achievement over anextended 
period of time. 

3. Other methods of assisting potentially 
superior students who are underachievers, such as 
individual counseling, study clinics, and achieve- 
ment workshops, should be considered and e- 
valuated. 
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Special Equipment in Teaching the Gifted 


ACCORDING TO a recent survey" pertain- 
ing to the education of academically superior stu- 
dents, a questionnaire was sentto principals of high 
Schools throughout the United States in which there 
were Merit Certificate Winners in the 1956 compe- 
tition. The purpose was to determine what provi- 
Sions are being made for the academically gifted 
students in high schools and what provisions would 
be made if conditions permitted. Prominent on the 
list of thirty-seven questions was the question: “15 
Special equipment available for use in teaching the 
gifted?" The principals were asked to rate the 
practice suggested by the questioninthree different 
ways: first, in terms of how oftenorhow persist- 
ently the practice was utilized in the high sc hool; 
Second, in terms of how successful the practice was 
found to be when utilized; third, in terms of how 
valuable the practice would be without r e gard for 
the use which was being made of the practice. 

Tables I, П, and Ш show a detailed break- 
down of the responses of the 1, 063 principals who 
returned their questionnaires. Table Iprovides the 
data when the responses to Columns I and П were 
tabulated separately; Table П provides the data 
when the responses to Columns IandI were com- 
bined for tabulation purposes; and Table IN provides 
the responses to the theoretical evaluation. Table 
II gives a more detailed picture of the extent of u- 
tilization of the practice andof the degree of success 
with which it was utilized. 

Table I shows that 59. 5 per centprincipals 
indicated that special equipment was utilized very 
little or not at all in teaching the gifted, that 25.3 
per cent indicated thatit was usedtoa moderate ex- 
tent, and that 7 per cent indicated that it was uti- 
lized to a considerable extent. Furthermore, from 


* Morris J. Applbaum, A Survey of Special Provisions for the Education of Academically Superior Students in 
the United States. (Unpublished Doctoral Dissertation, 
See Phi Delta Kappan, Vol. XXXX, No. 8, May 1959. 


Teachers College, Columbia University) 1958,388 рр. 
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Table I can be gathered that 8.7 per cent principals 
found the use of special equipment in teaching the 
gifted very successful, 28 per cent moderately suc- 
cessful, and 39.8 per cent unsuccessful. Table i, 
however, provides a more revealing picture, Here 
5 per cent of the respondents indicated that the 
practice was utilized to a considerable extent with 
great success and 2 per cent to a considerable ех- 
tent with moderate success. Conversely, 3.2 per 
cent reported that the practice was utilized to a 
moderate extent with great success and 21. 1 per cent 
to a moderate extent with moderate success. 

Table III actually shows what the principals 
think of providing specialequipmentfor use in teach- 
ing the gifted. Althoughthis constitutes a theoretical 
evaluation of the practice, it is of great significance 
when compared with the findings in Tables I and II. 
Here, in Table III, 41.2 per cent of the principals 
responding feel that the utilization of special equip- 
ment in teaching the gifted would be of considerable 
value; 33. 8 per cent feel that it would be of moderate 
value; and 12.3 per cent feel that it would be of lit- 
tle or no value. 


Although it is not the purpose of this repc 7t 
to draw conclusions, itis nevertheless interesting to 
observe that only 12.3 per cent feel that there is no 
need for providing special equipment for use in teach- 
ing the gifted; yet only 7 per cent indicated that taey 
have such equipment available for extensive use. In 
the last analysis, if so many principals feel that it 
would be of considerable value to provide special 
equipment for the gifted, would it not appear to fol- 
low that a much larger number of principals should 
make this special provision for the gifted intheir 
own schools? 
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TABLE I 


FREQUENCY AND SUCCESS OF THE USE OF SPECIAL 
EQUIPMENT IN TEACHING THE GIFTED 


Responses to Column I 
Respondents 
Utilization of Practice 


Number of Respondents Per Cent of Respondents 


Es s pos Jo | n] p ЕЕ 


Respondents Responses to Column II 


Evaluation of Practice 


Number of Respondents Per Cent of Respondents 


meni. | ЕЕЕ: 
mom le) [sie 


Legend 


ColumnI - 
A - utilized to a considerable extent 


B - utilized to a moderate extent 
C - utilized very little or not at all 
NR - no response 


Column П - 
A - very successful 
B - moderately successful 
C - unsuccessful, or successful to only a very limited degree 


NR - no response 
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TABLE II 
SUCCESS BY FREQUENCY OF THE USE OF SPECIAL 


EQUIPMENT IN TEACHING THE GIFTED 


Responses to Columns I and II 
Respondents 


Utilization and Evaluation of Practice 


Number of Respondents 


Per Cent of Respondents 


Legend 


AA - utilized to a considerable extent and very successful 

AB - utilized to a considerable extent and moderately successful 

AC - utilized to a considerable extent and unsuccessful, or succe ssful 
to a very limited degree 

BA - utilized to a moderate extent and very successful 

BB - utilized to a moderate extent and moderately successful 

BC - utilized to a moderate extent and unsuccessful, or successful to 
a very limited degree 

CA - utilized very little and very successful 

CB - utilized very little and moderately successful 

CC - utilized very little and unsuccessful, or successful to a very 
limited degree 

CNR - not utilized, therefore not evaluated 
NR - no response 


Students 


Principals 


Students 


Principals 


TABLE III 
THEORETICAL EVALUATION OF THE PRACTICE OF UTILIZING 
SPECIAL EQUIPMENT IN TEACHING THE GIFTED 


Responses to Column III 


Theoretical Evaluation 


Students 


Principals 


Legend 
А - of considerable value 
B - of moderate value 
C - of little or no value 
NR - no response 
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An Experiment Attempting to Determine the Effectiveness of 
Frequent Testing as an Aid to Learning in Beginning College 


Courses in American Government 


The Problem Do **popquizzes"' help students learn 

American Government? The members of the staff 

in the Government Department at West Texas State 

College in Canyon, Texas, who were teaching the 
beginning course in American National Government 
were concerned with the problem of encouraging 
Students to read textbook assignments in the course. 

A natural assumption which was made was that the 
students who read assignments regularly would ac- 
quire and retain basic content information toa 
greater extent than those who neglected to read 
assignments. In order to encourage daily reading 
of assignments, it was suggested that each instuctor 
in the department administer a number of tests at 
frequent intervals. These tests were to be very short 
in nature, designed to determine whether or not the 
student had read his assignment for theday. A total 
of twelve of these ** pop quizzes” were given in each 
class in addition to three regular hour long exami- 
nationsduring the course of the semester,and of 
Course, the final examination. 


Course Objectives The course in American National 
Governmentas it is taught at West Texas State 

College is a “ traditional” course in that it empha- 
sizes knowledge of organization and functions of 
American Government. The course is a general 

survey of the executive, legislative and judicial de- 
partments of our national government. It includes a 
survey of the United States Constitution, the concept 
federalism, and deals at some length with the func- 
tions of major departments and agencies of our 
nationalgovernment. The course attempts to convey 
as much of the ‘‘factual’’ knowledge about United 
States Goveramental organization and functions as 
can be adequately handled in one semester. Assign- 
ments consist largely of textbook reading assign- 
ments. The methods of teaching in thecourse are 
primarily lecture and class discussion. 


The Hypothesis There is no significant difference 
ие DYES er 
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inachievement on desired outcomes between classes 
which are given frequenttests in American National 
Government and classes which are tested only three 
times during the semester. 


Measuring Instruments 


Cooperative School 
and College Ability 


Тевї{й.. 1.4 ат. iae (Matching purposes) 

Pre-Test......... eet Cooperative Ameri- 
can Government 
Test Form Y 

Post Теве. 00.0. essnee Cooperative Ameri- 
can Government 
Test Form X 

Experimental Tests .........- Instructor made 


«рор quizzes” (ad- 
ministered only to 
experimental group) 


Hour Ехатв.%........“..... Three separatereg- 
ular hour -long in- 
Structor made essay 
exams. Adminis- 
tered periodically 
throughout course 
(administered to 
bothgroups. Equiv- 
alent forms.) 


Experimental Design Two classes were selected 


with a total of approximately 38 students for purposes 
of the experiment. Theclasses were divided into two 


groups, anexperimental and control group. The ex- 
perimental class wasgiven frequent testing inaddi- 
tion to the schedule of regular examinations, while 
the control class was given only three regular, 
hour-long examinations, Both classes were selected 
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CHART 1 

Student Hour Test Post Test Hour Test 

Pairs Pre-Test Average Post Test Differences Differences 
Cont. Exper. Dot сыр TRE UO nme E-C 
1 62 6l 81 86 67 65 5 -2 
2 59 60 96 93 64 63 ES EI 
3 58 60 92 93 51 62 1 П 
4 57 58 94 82 в 62 512 1 
5 54 58 85 84 49 62 -1 13 
6 54 57 88 94 60 62 6 2 
т 53 57 84 TI 57 52 ат -5 
8 52 54 96 88 58 56 -8 22 
9 52 52 82 87 52 58 5 6 
10 51 52 90 88 57 63 -2 6 
П 50 49 70 19 46 54 9 8 
12 48 47 90 19 58 46 E -12 
13 46 47 93 13 53 51 -20 22 
14 44 46 86 76 43 46 -10 3 
15 44 46 80 84 50 52 4 2 
16 44 46 84 87 51 50 3 -1 
17 43 45 81 76 а 48 -5 7 
18 42 45 т 83 49 53 12 4 
19 42 45 84 15 51 53 -9 2 
EX 955 985 1627 1584 1018 1058 ZX-43 ZX-40 
Mean 50.26 51.84 85.69 83.37 53.58 55.68 2726 250 
t 1.19 1.57 
ZX*-1335 ZX*-696 


OO. iol. lie dae Т Кыш сш и ше сш I E ccc с ——— 
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on a random basis at the time of registration. Both 
classes were taught by the same instructor. 

The control andexperimental groups were matched 
on the basis of the Cooperative School and College 
Ability Tests and their factual knowledge of the 
organization and functions of American Government 
as measured by the Cooperative Test in American 
Government. Students were matched оп the basis 
of these tests during the first week of the semester. 
(See column A, Chart I) 


Procedure The control group had a mean of 50. 26 
compared to a mean of 51.84 for the experimental 
group. The experimental group achieved a mean of 
52. 34 on the Cooperative School and College Ability 
Test, while the control group mean was 52.37 on 
this measure. The “7” technique was used to de- 
termine the significance of difference in means. The 
mean of the individual pair differences was calcu- 
lated and found to be 2.10 in favor of the experimental 
group. The standard error of the mean was calcu- 
lated at 1. 34. The resulting 47? value was 1. 57. 


Findings In the experiment it was evident that the 
hypothesis of no difference between the two methods 
had to be accepted. Means of 53.58 for the control 
group and 55. 68 for the experimental group indicated 
somewhat better results with the experimental group, 
but this difference could have been due to chance. 
Similarly, means of 83. 63 for the control group and 
83, 37 for the experimental group on the average of 
hour tests yielded a mean difference of 2. 26 and a 


“t? of 1.19 which was not statistically significant . 

Even though the two means are reversed there is 
not enough difference inthe groups on the hour-test 
average or the post test to warrantany definite con- 
clusions. Using both hour-test averages and post- 
test results as a basis of comparison of thetwo 
groups we could only conclude that the difference 
that exists between the two classes could easily be 
due to chance and not to the different methods em- 
ployed in teaching the classes. 

The students in the experimental section were 
asked their opinion on the “рор quizzes 'andthere 
was a near unanimity of opinion favorable to the 
technique. Comments suchas ‘‘the quizzes kept me 
on the ball'' and “І was forced to develop regular 
study habits" were common. Most of the students 
who took “рор quizzes" felt it helped them learn 
the basic information required in thecourse even 
though the results of the experiment indicate this 
was not true. The experimental class was also more 
responsive in the opinion of the instructor. The 
quizzes stimulated more discussion in the experi- 
mental class. There was a tendency, however, for 
the students in the experimental class to discuss 
minor points raised by questions on the “рор 
quizzes” and the result was tnat these students tended 
to become more interested in detail rather than 
problems of broader significance. 


The advisability of frequent testing of the sort 
here employed remains to be decided by further re- 
search. 


For Each of Our Subscribers....A, Very Special Holiday Present! 


Yes, at this holiday season we are particularly happy to announce that 
The Journal of Educational Research will publish a ‘‘gift”’ issue this year. 

As you may recall, the November JER contained an editorial on teach- 
ing machines and programmed learning. Init we suggested that many fine 
developments will stem from this new attempt to combine the science and 
art of teaching into action programs. . . . and much valuable time, money, 
and energy will be wasted on unproductive activities and unsupported 


statements. 
We already have reports of researches in hand that suggest a bright 


future for programmed instruction, as well as reports that indicate some 
of the claims made for teaching machines are not supported by data now 
available. So significant do these new findings appear that we are bring- 
ing them together in a special added June issue of The JER, to be sent to 
you FREE as a means of thanking you for the fine response you һауе made 
to our current development program. 

(Do you have an appropriate paper for this “bonus” issue? Additional 
manuscripts will be accepted up to April 1, 1962. ) 
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The Influence of the Time Factor on the Scores of the Triggs 


Diagnostic Reading Test as Reflected in the Performance of 


Secondary School Pupils Grouped According to Ability 


THE RELATIONSHIP between timed and un- 
timed scores in educationaltesting has been subject 
to much controversy. This paper will present a 
Study designed to determine the relationship of timed 
and untimed test scores as observed when vocabu- 
lary tests were given to secondary school students 
grouped according to ability. 

In an experimental testing program at Hor- 
nell High School, the Triggs Diagnostic Reading 
Test - Section 1: Vocabulary (Revised) was admin- 
istered to the 9th, 10th, 11th, and 12th grades 
under timed (speed) and untimed (power) conditions. 
Section 1: Vocabulary (Revised) measures r ea ding 
vocabulary in English grammar and literature, 
mathematics, science, and social studies. The four 
Separate scores permit reliable comparisons of word 
knowledge in these major content fields. 

In the grades tested there were: 196 fresh- 
men - 27 in scholarship group, 111 inregents group 
(average group), 58 in non-regents group (non-aca- 
demic group); 217 sophomores - 25 in scholarship 
group, 128 in regents groups, 64 in non-regents 
groups; 174 juniors - 21 in scholarship group, 118 
in regents groups, 35 in non-regents group; 135 
Seniors - 21 in scholarship group, 75 in regents 
£roups, 39 in non-regents groups. 


Problem 
Our intention was to determine: 


First - whether the speeding of the vocabu- 
lary tests makes the scores reflect some- 
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thing different from that which they indicate 
when the students are given unlimited time. 


Second - just how much difference isre- 
flected in each group of our three track sys- 
tem when more time is allowed. 


Third - if there is a marked difference be- 
tween the score s ona vocabulary test when it 
is given under timed and untimed conditions, 
are the scores a symptom of reading skills 
inadequate for understanding at the individ- 
ual's potential? 


Procedure 


After the purposes and requirements of the 
study were outlined, the chairman of the English 
Department met with the testing director anda pro- 
gram was set up for the entiretesting to be carried 
on in all the English classes of the 9th, 10th, 11th 
and 12th grades. The students were given the in- 
Structions for the timed test. So that they would do 
their best, students were not told the test would be 
returned to them for power testing. After the stip- 
ulated time, the tests were collected and a red pen- 
cil mark drawn under the last item finished in each 
of the vocabulary areas. The tests were scored, 
and the raw scores and the percentiles for the speed 
testing were tabulated (T score). The tests were 
then returned, and the students were instructed to 
attempt the items which they had been unable to 
reach because of lack of time, Next, the tests were 
Scored again. The score from the power testing 


DATA RESULTING FROM AN ANALYSIS OF TRIGGS DIAGNOSTIC READING 
TEST - SECTION 1: VOCABULARY (REVISED) GIVEN TO THE 9th, 10th, 11th 


TABLE I 


12th GRADES UNDER TIMED AND UNTIMED CONDITIONS 


Total Number Tested 


196 Freshmen 


Number Tested 


Number Completing Test 
within Time 


Number Needing Additional Time 
to Complete Test 


Number Needing Additional Time 
for Completion of: 


English 
Mathematics 
Science 
Social Studies 


Number Who Raised Overall Per- 
centile Scores with Additional 
Time 


Number Who Failed to Raise Over- 
all Percentile Scores when Given 
Additional Time 


217 
Sophomores 


174 Juniors 


135 Seniors 


В a 8 
с. с. 
= pue suos = 5 
In © В 2$ g ee beta leo 
alg | рда: я 3 £|5 
5р g = = 8 e ep g 
Ble 2 a 22129 Slelz 
27 111 58 25 128 | 64 21 118| 35 21 75 39 
16 19 4 14 36 |15 16 53| 14 14 29 6 
11 92 54 Lt 92 | 49 5 65 | 21 7 46 33 
11 72 52 7 90 | 45 5 50| 12 5 44 31 
7 72 52 8 71 | 38 3 37| 19 5 33 29 
4 62 46 6 42 | 37 0 35| 19 0 17 20 
3 59 35 3 44|31 3 38 12 0 12 16 
84 53 8 84 | 46 4 53| 21 5 42 32 
4 8 1 3 8 3 1 12 0 2 4 1 
= 9 
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was designated as the E score. 

The speed scores and the power scores were 
compared to find out whether the additionaltimeac- 
tually did benefit certain students. We had antici- 
pated that additional time would increase the scores 
of some students, The question we really wished to 
throw light on was whether the added time would 
serve to change the relative standing of many stu- 
dents and what percentage of change would occur 
in each track. We also wished to determine wheth- 
er marked difference in speed and power scores 
suggests that a student’s reading skills are inade- 
quate for working at full potential. 

All the freshmen students had been screened 
for academic intelligence to their tracks beginning 
in the kindergarten with the SRA Primary Mental 
Abilities - (5-7); in the 3rd grade with Pintner Du- 
rost - Reading and Picture batteries; in the 6th 
grade with Lorge Thorndike - verbal and non- 
verbal batteries; in the 8th grade with SRA Primary 
Mental Abilities, 11-17. Inthe reading area they 
had been screened at these same levels. Inthe 
kindergarten with Diagnostic Reading Test-Survey 
Section: K-4 Reading Readiness Booklet; at the 3rd 
grade level with Diagnostic Reading Test - K-4, 
Booklet 3; at the 6th grade level with Diagnostic 
Reading Test - Grades 4-8, Booklet 1 andthe Iowa 
Silent Reading Elementary Tests; at the 8th grade 
level with Diagnostic Reading Test - Section 1: Vo- 
cabulary - Grade " through College Freshman Year. 
Also at the 8th grade level aptitude tests had been 
given to facilitate more accurate 9th grade place- 
ment, The aptitude tests administered were Cali- 
fornia Algebra Aptitude Test, Language Prognosis 
Test, and the New York State Junior High School 
Survey Test in General Science. 

These objective scores along with the teach- 
ers' subjective ratings of achievements of each stu- 
dent were submitted for evaluation at each of these 
grade levels. We continued to be careful that stu- 
dents were placed in their ability track at the end 
of the Sophomore, junior, and senior years by hav- 


ing guidance committees assigned to each class for 
Such purpose. 


Table I presents data on the analysis made 
from the test results. 


Findings 
Analysis of data in Table I indicates: 


First - That some high school students with- 
in each group made marked changes in their rel- 
ative standings when they were given additional 
time during which to work on the test. Therefore, 
Ерге and power scores cannotbe used interchange- 
ably. 

Second - that changes in relative scores oc- 
curred with considerably greater frequency in the 
regents and non-regents groups than inthe scholar- 
Ship group. A critical analysis ofthe data indicates 


that, although the frequency of change in the power 
Scores in the non-regents group is large, the dif- 
ference in the scores is not great enough to believe 
that additional time would benefitthese non-regents 
Students significantly. Тһе striking reflection of 
change is in the standings of the regents group. 

Third - that there is a relationship in the 
results of these scores to the general academic a- 
bility of the student. 


Summary 


Under the power conditions, the average 
student, that is the slow and accurate student, comes 
out nearer the top when given plenty of time but suf- 
fers when there is a time limit. Changes in relative 
Standings occur with considerably greater frequen- 
cy through the middle range of scores than they do 
at either extreme of the range of scores. 
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Attitude Toward Mathematics and the Predictive 
Validity of a Measure of Quantitative Aptitude 


MANY PERSONS concerned with mathematics 
learning believe that ‘‘nonintellectual’’ factors such 
as attitude and emotional make-up have an impor- 
tant bearing upon a student's success with this sub- 
ject. Barakat's (1) factorial analysis of mathemat- 
icalability suggests that such factors as **emotional 
irritability’’ and lack of industry interfere with suc- 
cess in mathematical tasks. Bendig and Hughes (2), 
investigating achievement in introductory statistics, 
attributed about four percent of the variability in a- 
chievement to differences in students' attitudes. 
Rochlin's (5) study using projective techniques 
pointed up various personality factors associa ted 
with under- and over-achieving students. 

In the present study, a revaluation was made of 
one of the most widely used measures of quantitative 
aptitude through an investigation of differences in 
the predictive validity of A. C. E. Psychological Ex- 
amination Quantitative (Q) scores for students 
grouped on the basis of their attitude toward math- 
ematics. The hypothesis investigated was that high- 
er correlations for the predictor variable would be 
found with subjects having more neutral attitudes 
toward mathematics than with those having the least 
neutral attitudes. 


Procedure 


The subjects of this investigation were 264 col- 
lege sophomores attending the full-time day session 
of a metropolitan, non-resident college. All were 
white males and were comprised of three groups. 
The first group contained 93 students enrolled in the 
Business Administration curriculum, each having 
completed a one-year course in Business Mathemat- 
ics. The second group contained 72 students en- 
rolled in the Social Science program, all of whom 
had completed a one-year course in Introductory 
College Mathematics. Thefinalgroup contained 99 
subjects enrolled in the NaturalScience curriculum. 
Each student in this group had completed a course 
of Analytic Geometry and Calculus. Each group had 
a different instructor who taught all of the subjects 
in his respective group. 
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A standardized attitude scale was used to obtain 
an index of student attitudetoward the subject of 
mathematics. This scale was group administered 
by non-instructional personnel in order to minimize 
the restraints which might prevail in a classroom 
atmosphere. The subjects within each curriculum 
were ranked from highest to lowest on the basis of 
their attitude scale score. They were then divided 
into equal thirds. This provided three sub-groups 
within each curriculum which were variously termed 
High, Middle, and Low, the High group containing 
subjects with the most positive attitude scores, and 
the Low group containing those with the least posi- 
tive attitude scores. 

All subjects had also taken the A.C.E. Psycho- 
logical Examination, 1952 edition. Their Quantita- 
tive scores on this testservedas the predictor var- 
iable. The subjects’ mathematics grades were used 
as the criterion measure of mathematics achieve- 
ment. Separate Pearsoniancorrelations were com- 
puted for the High, Middle, and Low attitude sub- 
groups of each curriculum, as well as the standard 
deviations of both variables. The results are pre- 
sented in Tables I, II, and Ш. 


Results and Discussion 


An inspection of Tables I, П, and Ш r evealsa 
consistent pattern of higher correlations for the 
Middle, or most neutralattitude sub-groups. These 
correlations were .66 for the Natural Science, . 49 
for the Social Science, and. 50for the Business Ad- 
ministration Middle sub-groups. One may note 
from the tables that these Middle groups' correla- 
tions were the only ones which proved to be signifi- 
cant. The least indication ofa positive relationship 
between Q-scores and mathematics grades was 
found with the Low, or most negative attitude sub- 
groups, particularly for the SocialScience Curric- 
ulum. 

Guilford (1950) has pointed out that the size of à 
correlation is markedly influenced by the variabili- 
ty of measured values in a given population. The 
more variable the population's values, the higher 


* This report is part of a larger study done under the direction of Dr. Francis J. Crowley and submitted as 2 
doctoral dissertation at Fordham University, Graduate School of Education. 


CRISTANTIELLO 


TABLE I 


CORRELATION BETWEEN Q-SCORES AND MATHEMA TICS GRADES-- 
NATURAL SCIENCES 
к————————————————————————————————— 


Attitude Group г SD, Q-Scores SD, Grades Level of Significance М 


High .33 9. 15 10.22 £ 33 
Middle - 66 9, 42 10. 22 .01 33 
Low . 03 9.36 12.02 т 33 


* Very close to . 05 level. 


TABLE П 


CORRELATION BETWEEN Q-SCORES AND MATHEMATICS GRADES-- 
SOCIAL SCIENCE 


Attitude Group г SD, Q-Scores SD, Grades Level of Significance М 


High .30 9. 99 9. 68 = 24 

Middle . 49 8.76 9.80 ‚05 24 

Low egi 8.31 10. 64 E 24 
TABLE III 


CORRELATION BETWEEN Q-SCORES AND MATHEMATICS GRADES 
BUSINESS ADMINISTRA TION 


Attitude Group г 8р, Q-Scores SD, Grades Levelof Significance М 


High .26 7.38 8. 06 > 31 


Middle .50 9.95 10. 75 
Low .25 10.33 9. 38 = 
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one may expect the correlation, other things being 
equal. By comparing the obtained SD's of each cur- 
riculum's sub-group on both variables, one notes 
their similarity. To check further this point, the 
standard error of the difference between the SD's 
was calculated and none of the differences for either 
variable proved significant. 

Peters and Van Voorhis (4) have also stated that 
for two correlations to be comparable, they must 
have been computed from populations of the same 
degree of heterogeneity. Since this is the case in 
this study, the Q-scores seem to have served as 
better predictors of mathematics grades for the 
Middle attitude sub-groups than for either the High 
or Low attitude groups. In order to test this point, 
the correlations were converted to Fisher's Z-func- 
tion and the standard error ofthe difference between 
two Z's was determined. The significantdifferences 
which were found were between the Middle and Low 
NaturalScience sub-groups. This difference was 
significant at the . 01 level. The other was between 
the Middle and Low SocialSciencesub-groups. This 
difference was significant at the . 04 level. It thus 
appears that for the subjects inthese two curricula, 
the A. C. E. Q-score had a greater validity іп pre- 
dicting mathematics achievement for students with 
the most neutral attitudes toward thesubject, and a 
significantly lesser validity for the group with the 
relatively least favorable attitudes toward mathe- 
matics. 


Summary and Conclusions 


The correlation of A. C. E. Q-scores with math- 
ematics grades was investigated for 264 college 


sophomores divided into sub-groups on the basis of 
their attitude toward mathematics. Itwas foundthat 
there was a consistently higher positive correlation 
for Middle, i.e., the most neutralattitude sub- 
groups throughout various curricula. Several of 
these correlations proved to be significantly differ- 
ent. These were between the Middle and Low atti- 
tude sub-groups in the Natural science and Social 
Science curricula. It would seem that the use of the 
А.С.Е. Q-score as a predictor of success іп math- 
ematics should be re-evaluated. It is likely that an 
instrument such as this may be moreefficiently 
employed when the prevailing attitudes of a given 
population toward a particular subject are taken in- 
to account. 
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Personality Factors Related to College Success 


PSYCHOLOGISTS and couns elors working ina 
college setting have long been interested in identi- 
fying the factors which operate to determine the 
level of performance of collegestudents. This соп- 
tinued concern has resulted from the obvious fact 
that many students with intelligence considered suf- 
ficient for academic success have failed or have a- 
chieved at a level below that commensurate with 
their ability. Тһе interest of these investigators 
has centered on a large number of factors, ranging 
Írom the family background of students totheir pro- 
ficiency in the skills considered necessary for col- 
lege success. 

Recently, however, moreand moreattention has 
been directed to the effect onachievement of select- 
ed personality variables. This new emphasis has 
stemmed from the now generally accepted belief 
that the academic performance of the student is an- 
other aspect of his total behavior, and, as such, is 
determined and influenced by the dynamic processes 
of personality. 


Problem 


_The purpose of this investigation was to deter- 
mine the differences between the highest (H) and 
lowest (L) ranking students inthe day session of the 
School of Education of a large urban university with 
Tespect to their personality andadjustmentas indi- 
cated by scores on the following two tests: 

(1) On the Minnesota Multiphasic Personality In- 
ventory (MMPI): hypochondriasis (Hs), psychopathic 
deviate (Pd), interest (Mf), paranoia (Ра), psychas- 
thenia (Pt), schizophrenia (Sc),andhypomania (Ma), 
depression (D), and hysteria (Hy). 

(2) On the College Inventory of Academic Adjust 
ment (CIAA) (Borrow, Stanford University Press, 
1949): the curricular adjustment score (satisfaction 
With college routine and the chosen curriculum); the 
maturity of goals and level of aspiration score (ed- 
“cational and life goals, the desire and effort to a- 
ев them, and the sense of values); the person- 

efficiency score (effectiveness in scheduling and 
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carrying out daily activities); the study skills and 
practices score (study behavior at the desk and in 
the classroom); the mental health score (emotional 
adjustment, the extent to which the student believes 
himself beset by worries and emotional upsets); the 
personal relations score (ability to get along with 
instructors and fellow students); and the composite 
inventory score (the total of the six scores of the C 
TAA). 


Subjects 


The subjects participating in this investigation 
were 64 full-time students enrolled during the 1955 
-56 school year. - Thirty-two subjects, eight from 
each class, comprised the group of lowest ranking 
students. The H and L students were chosen on the 
basis of the first semester grade-point averages. 
The subjects were chosen randomly from the ten 
highest ranking and the ten lowest ranking students 
in each class. 


Method " 


Each subject completed the MMPI and the CIAA. 
The test scores of the H and L students were tested 
for significance by the analysis of variance tech- 
nique and the “4” test of significance. Тһе .01 lev- 
el of significance was employed. 


Results 


Analysis of the scores on the MMPI revealed 
that lowest ranking students in each class, in gen- 
eral, scored higher mean scores (indicating poorer 
adjustment) than did the corresponding highest 
ranking students on nine scales of the MMPI. The 
differences which were found to be significant were 
those between the H and L juniors on the following 
five scales: Hs, D, Hy, Pt, and Sc. Inaddition, the 
combined lowest ranking group was found to have 
significantly higher mean scores than did the total 


This stud: i tensive investigation of selected characteristics of highest and 
y constituted one part of a more ex i t hii 

lowest ranking college studente which was completed under the mentorship of Dr. James J. Cribbin as part 

9f the requirements for the Ph. D. degree in the division of Educational Psychology, Measurements and Guid- 


ance at the School of Education at Fordham University. 
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highest ranking group on the following four scales: 
Hs, Hy, Pt, and Sc. 

When the H and L subjects in this investigation 
were compared with reference to the number of high 
scores (а T score of 70 or more) onthe nine scales, 
it was found that in every class, the L students had 
a greater number of high scores onall of the scales 
than did the H students. In the freshmen class, the 
lowest ranking subjects had nine high scores as 
compared with three for the highest ranking fresh- 
men. In the sophomore class, the lowest ranking 
subjects received 12 high scores, while the highest 
ranking subjects received only one high score. The 
largest difference between the highest andlowest 
groups was found in the junior class. The highest 
ranking juniors had only five high scores, while the 
lowest ranking juniors had 24 high scores. In the 
senior class, the highest ranking students had no 
high scores on any of the scales, and lowest ranking 
students had only one high score. The combined 
highest ranking students had nine high scores on the 
nine scales combined, while the combined lowest 
ranking subjects had 46 high scores. Over half of 
the high scores of the combined lowest ranking 
group were made by the lowest ranking juniors. 

Analysis of the scores on the CIAA revealed that 
the highest ranking students in each class received, 
in general, higher mean sc ores (indicating better 
adjustment) than did the corresponding lowest rank- 
ing students on all the subtests of the inventory. The 
mean differences in the freshmen class were not 
significant. In the sophomore class, the meancom- 
posite inventory score and the mean personal effi- 
ciency score were significantly higher than those of 
the lowest ranking students. 

The highest ranking juniors were found to be sig- 
nificantly superior to the lowest ranking juniors on 
the following scores: the composite inventory score, 
the curricular adjustment score, the maturityof 
goals and level of aspirationscore, the personal ef- 
ficiency score, the study skills and practices 
Scores, and the mental health score. 

In the senior class, significant mean differences 
favoring the highest ranking seniors were found with 
regard to the following scores: the composite inven- 
tory score, the maturity of goals and level of aspi- 
ration score, the personalefficiency score, and the 
mental health score. 


The highest ranking students as a group were 
found to be significantly superior to the lowest rank- 
ing students as a group on the following scores: the 
composite inventory score, the curricular adjust- 
ment score, the maturity of goals and level of as- 
piration score, the personal efficiency score, the 
study skills and practices Score, and the mental 
health score. 


Conclusion 


The results of the present study indicate that with 
respect to the subjects and the school under inves- 
tigation, a definite relation existed between level of 
achievement and selected adjustment factors. 
Scores on the MMPI and the CIAA indicated, in gen- 
eral, that the highest ranking students tended to be 
better adjusted than were lowest ranking students. 
In addition, certain of these factors significantly 
differentiated betweenthe highestand lowest ranking 
students. These results provide further evidence to 
support the belief that personality factors are relat- 
ed to the level of achievement of college students. 

It must be remembered, however, that the rela- 
tionship existing between emotional adjustment and 
level of achievement is probably not a direct one. 
Although the present study provides no evidence in 
this regard, evidence has beenfound in the research 
on this problem to indicate that em o tional adjust- 
ment affects achievement in an indirect way. Un- 
stable and maladjusted students have been found to 
do less well in their studies in proportion to their 
intelligence than have students who were well-bal- 
anced. Research has also shown that a closer cor- 
relation exists between capacity and achievement in 
stable groups than in unstable groups. It might be 
concluded, therefore, from this evidence that the 
emotional adjustment of the student affects his leva 
of achievement by affecting the degree to whichuse 
is made of his potential. 

These results would imply, therefore, that if 
colleges and universities are to ensure the optimal 
development of students, they cannot ignore the in- 
fluence of personality and emotional factors upon 
achievement, Such institutions s hould screen stu- 
dents for personality and a dj ustment problems and 
provide personnel and facilities for the effective 
treatment of students with such problems. 


Field News 


SUPPORT FOR THE COOPERATIVE RESEARCH 
The House-Senate Conference Committee has agreed on a final fig- 


ure of five million dollars for the Cooperative Research Program for 
fiscal year 1962. 


PROGRAM 


NORTH CENTRAL ASSOCIATION EXTERNAL 

TESTING STUDY The Committee on Articulation of High Schools and Colleges of the 
N.C. A. hasissueda second report on the study of testing by external 
agencies and institutions. Some of the current criticisms and prob- 
lems of externaltesting as expressed by secondary school people are: 


1. 


Alarm that external tests will exert an undue influence on 
the secondary school, especially on the curriculum and the 
instructional program. 


. Invalid comparisons of schools because ofthe publicity given 


to scholarship winners and semifinalists. 


. Undue pressure to have their students participate іп the ex- 


ternaltest programs for duplication of testing by various 


external testing agencies. 
Heavy demands on school administrators and counselors 


because of the time required by external testing. 

Increased costs to the students. 

Confusion and guessing about coaching for tests. 
Commercial promotion of testing. Ў 

Failure of colleges in general to consult the high schools 
when.making plans for external testing. 

The purpose of externaltesting for graduates of N.C, A. ac- 
credited high schools who desire admission to any N.C. A. 
college or university. 


RESEARCH AT STATE UNIVERSITY COLLEGE OF 
1. A longitudinal four-year study of authoritarianism among four 


hundred subjects is being carried out. These subjects entered 
college in 1956. The purposeofthestudy is to determine the sta- 
bility of authoritarianism and its relationship to withdrawal or sep- 
aration from college, size of home town, academic frustration, 
participation in extra-curricular activities, and membership in a 
sorority or fraternity. 

2. A study is being made of changes in social distance over a four- 
year period toward peoples of thirty foreign countries among four 
hundred students entering college in 1956. The major hypothesis 
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being tested is: Subjects remaining in college will have shown а. 
significantly greater decrease in Social distance scores as meas- 
ured by the Bogardus Social Distance Scale than those subjects not 
remaining in college. 

3. A study is being made of attitudes of students toward the claim of 
some social scientists that schools and colleges are biased in fa- 
vor of the middle-class student. Hollingshead’s Index of Social 
Position is the instrument being used. 

4. A study of American misconceptions about India and Indians is 
underway. 

5. A study is underway to determine what changes, if any, have oc- 
curred in American middle-class ideology over a period of thirty 
years. A questionnaire has been made up of selected beliefs re- 
ported in Chapter 12 of Middle Town in Transition by Robert 5. 
and Helen M. Lynd. 


HAROLD FIELDS TO JOIN McCANN 


ASSOCIATES 


McCann Associates, specialists in selection of personnel in industry, 
civil service and the professions, has announced the addition to its 
staff of Harold Fields as Senior Associate in Education. He will di- 
rect a newly created divisionof personnel selection in School systems. 


Forthe past twenty-one years, Mr. Fields has served as a member 
of the New York City Board of Education, Inthat capacity he has de- 
veloped tests and initiated examinations for members ofthe teaching 
and supervisory staffs. Hehas also served as educational consultant 
to the Commonwealth of Puerto Rico in the selection of personnel and 
has lectured extensively on the subject in the United States and 
abroad. 


RESEARCH AT UTAH STATE 
UNIVERSITY 


Dr. Walter R. Borg, Chairmanofthe Bureau of Educational Research, 
reports several studies currently underway or recently completed. 
These include: 

1. **A Study of Optional Summer School Programs in Secondary 
Schools of Utah." The primary objectives of this study are 
to determine the present status of summer school programs, 
the need for expansion ofthe present programs, how sucha 
program might be organized and financed, the role of var- 
ious governmental agencies in supporting such programs, 
and what gains can be expected from summer school for 
gifted children and children in need of remedial work. 

2. “Ап Evaluation of Ability Grouping.’’ This is a four-year 
study in which subjects at elementary, junior high, and high- 
school levels in ability grouping systems are compared with 
subjects in a system employing random grouping. Groups 
of comparable ability level are compared on such variables 
as achievementgains, sociometric scores, personality var- 
iables, self-concept measures, andattitudes toward school. 

3. “Тһе Teaching Machine as a Study Aid." This study com- 
pares the achievement gains made by college students who 
studied two hours a week using teaching machines with a 
comparable groupof students who studiedtwo hours per week 
without the use of the machines. Comparisons also were 
made on the basis of intelligence and study habits. Results 


FIELD NEWS 


Showed a significantly greater gain made by studentsusing 
the machines as a study aid. Students with poor study hab- 
its gained as much when using the machine as students with 
good study habits who did not use the machine. This and 
other findings suggest the use of the machine compensates 
to some degree for differences in study skills. Students in 
the lower fifty percent of the group using the machines on 
the basis of verbal intelligence made achievement gains com- 
parable to students in the upper fifty percent of students not 
using the machines. 


RALPH C. M. FLYNT TO SERVE AS ASSISTANT 


COMMISSIONER 


Commissioner McMurrin has asked Dr. Ralph C. M. Flynt to serve 
as Assistant Commissioner for Research inthe U. S. Office of Educa- 
tion. Dr. Flynt has worked with the American Library Association. 
He was one of the leaders who helped initiate the Cooperative Re- 
search Program originally when Dr. Samuel Brownell served as 
Commissioner of Education. 


RESEARCH SYMPOSIUM 


SERIES 


Under sponsorship of Phi Delta Kappa, the University of Oregon, and 
Chi Campus Chapter, thethirdofa series of symposia devoted to ed- 
ucational research was held July 10-11, 1961, onthe University of 
Oregon campus. The symposium was chaired by Dr. Keith Gold- 
hammer, Director of the Bureau of Educational Research at the Uni- 
versity of Oregon, and Dr. Charles R. Foster, President of Phi 
Delta Kappa. The publication resulting fromthis symposium will be 
available early in 1962. Copies of the publication which include the 
papers and discussions of the 1959 symposium at the University of 
Virginia and the 1960 symposium at the University of Minnesota may 
still be purchased through the International Office of Phi Delta Kappa 
in Bloomington, Indiana. Тһе Virginia Sy m posium was devoted to 
the rationale of educational research today and attempted to assess 
what we are doing in educational research as a basis for establishing 
future goals and emphases. The Minnesota Symposium dealt with re- 
search designin analysis and presented the ideas of leading authori- 
ties from various intellectual discipline. 


Readers are invited to send items of research interest to 
John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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Т Among 
È| Those 
Present 


REVA WHITE ALLMAN is professor of education at 
Alabama State College. Her BA is from Miles Col- 
lege, her MA from Fisk, and her PhD from the Un- 
iversity of Michigan in 1951. She has taught in the 
Birmingham public schools, at Alabama A and M 
College, and at Miles. Dr. Allman was recently 
cited for outstanding work in higher education by the 
Pi Lambda Sigma Honor Society of Payne College. 
Her memberships include NEA, AAUP, NSSE, and 
Alpha Kappa Alpha. Other writings include **A Study 
of General Education with Emphasis on Teacher Ed- 
ucation in Negro Schools,’’ “Ап Evaluation of the 
Goals of Higher Education by 294 College Seniors of 
Alabama,’’ and “А Study of the Social Attitudes of 
College Students.” Mrs. Allman’s husband is а 
chemistry teacher. They have one daughter. 


VERNON TROXELL is a graduate of Illinois State 
Normal University and the University of Illinois. 
His undergraduate major was mathematics and his 
graduate emphasis was elementary school instruc- 
tion. Asanassistant professor in the School of Ed- 
ucation at the University of Kansas, Dr. Troxell 
teaches elementary methods courses and directs the 
reading and study skills clinic. He formerly taught 
at the University of Illinois and in the Forest Park, 
Illinois public schools. His memberships include 
Phi Delta Kappa, Kappa Delta Pi, Kappa Mu Ep- 
silon, KSTA, NEA, NSSE, NCTM, NCTE, and IRA. 


MRS. ZELL SoRELLE is a native of the Texas pan- 
handle. She received her BS and MA degrees from 
West Texas State College and has completed her 
course work toward a PhD at the University of 
Denver with a major emphasis in the field of oral 
communication. She has also done some work at 
Highlands University. Mrs. SoRelle has been inter- 
ested in television as it concernsthe habits and ac- 
tivities of school children. She is the mother of а 
seventeen-year-old daughter. JACK WALKER is 
professor of speech at West Texas State College 
where he earned his BS and MA degrees. His PhD 
is from the University of Denver. He has also stud- 
ied at the University of Texas and UCLA. 


HELEN (Mrs. Redmond) ROCHE is a psychologist 
in the Van Dyke PublicSchoolsof Warren, Michigan. 
She also teaches courses in clinical psychology at 
Wayne State University and at the University of De- 
troit. For the past ten summers, she has directed 
à reading workshop at Detroit. Previous articles 
include “Тһе Van Dyke Level System, ” ‘‘Help for 
the Retarded Reader,” and “Environmental Con- 
trasts.’’ 


E. A. ENSTROM received his BA from Waynesburg 
College, his MA from Penn State, and his PhD from 
the University of Pittsburgh. He has spent the past 
26 years specializing in solving alltypes of problems 
in the teaching of handwriting. In addition to writing 
and research, Dr. Enstrom has given over 150 lec- 
tures at some 35 colleges and universities ina five- 
state area inrecent years. Heis presently research 
consultant with the Peterson System of Directed 
Handwriting, Greensburg, Pennsylvania. 


Roche Enstrom 


THEODORE L. HARRIS, guest ed- 
itor for this issue of The JER, is 
professor of education at ће Univer- 
sity of Wisconsin. A Chicago PhD, 
he is a specialist in developmental 
and remedial reading, director of the 
summer reading clinic at Madison, 
co-author of The Learning Process, 
collaborator with Coronet Films, 
and contributor to the Encyclopedia 
of Educational Research. 


READING TESTS HAVE THEIR PITFALLS, Troxell reports: 1) the purpose for 
reading influences the speed with which material is read, 2) speed and com- 
prehension scores on a general reading test are not necessarily good pre- 
dictors of speed and accuracy for other kinds of material read for other pur- 
poses, 3) ability to read expository mathematical material successfully is by 
no means an accurate indicator of success in arithmetic. 


LEFT-HANDED WRITERS ARE MORE COMMON THANISSOMETIMES SUPPOSED. 
In a survey of 92, 656 pupils in grades one through six in selected states, 
Enstrom found 11. 1 percent ‘‘southpaws’’ (12. 5 percent of the boys and 9. 7 
percent of the girls). The number did not decrease with advancing grades 


and age. 


CHILDREN NOT READY FOR READING profit more by being kept in a readiness 
situation until prepared for formal reading than by being placed in the first 
grade, failing, and having to repeat the work. Furthermore, moving chil- 
dren ahead into a formal reading program only after they succeed on the 
junior primary level does not delay their school progress. So reports Roche 
after ten years of action research in Michigan schools. 


REGULAR VOCABULARY DRILL paid significant dividends for juniors and seniors 
in Alabama State College, Allman says. Even во they fell short of the national 
freshman norm. Lowest scores were made in social studies words. 


JUNIOR HIGH STUDENTS SPEND MORE TIME WATCHING TELEVISION than they 
spend in school. At least that was the case when SoRelle and Walker studied 
the habits of pupils in Amarillo, Texas. Some 3500 students averaged 1216 
hours a year watchingtelevision and 1144 hours a year in classrooms. Three 
years later, as high school students, the same youths spent an average of 
923 hours a year watching TV. 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 5, February 1962) 


Summary of Investigations Relating to Reading 


July 1, 1960 to June 30, 1961 


THEODORE L. HARRIS 
University of Wisconsin 


Dr. Harris, a graduate of the University of Chicago, a former student of 
Dr. William S. Gray, and a well-known specialist in the field of reading has 
been asked to assume responsibility for The JER's annual summary of re- 


search relating to reading. —A. S. Barr 


IN THE many years in whichthe annual Summary 
of Investigations Relating to Reading has appeared, 
the flow of research on the sociological, psycholog- 
ical, physiological, and pedagogical aspects of 
reading has continued at anunchecked pace. So, too, 
has the number of bibliographies upon selected top- 
ics in reading and of summaries and reviews of re- 
Search related to reading. * 

Each of these serves a valuable purpose. The 
Summary, however, has for years served a unique 
function because of the scope of the research re- 
Viewed annually. The original intent of theSum- 
mary will perhaps continue to be best served by 
limiting the investigations cited and summarized to 
empirical research studies. Primary sources will 
be emphasized insofar as possible. It likewise ap- 
pears most appropriate to note the chief findings 
and nature of the experimental designs used in rela- 
tion to the research findings. Inthese ways it is 
hoped that the Summary will continue to serve a 
useful function for those interested inthe full spec- 
trum of empirical research related to reading, and 
in the progress of such researchfrom year to year. 


The Sociology of Reading 4 
1. Cultural Influences on Reading 


Effect of Cultural Background. Smilanski (131) 
reportedthat the reading and arithmetic achievement 
9f Israeli primary children reflected to а consider- 
able degree the effectof cultural standards in coun- 
triesfrom which their parents came. Children of 


* See, for example, the rather regular appearance of bibli 
Elementary English, Journal of Developmental Reading, 


parents from culturally underdeveloped countries 
achieved less well and showed more severe adjust- 
ment problems in reacting to school success and fail- 
ure than did children of parents from culturally ad- 
vanced countries. The comparative analysis was 
based upon 446 children in grades K-2. 

Gaier and Collier (51), in a comparison of story 
preferences of 199 American and 120 Finnish chil- 
dren with average CA’s of 10.0 and 11. 1 respective- 
ly, concluded that sex differences in reading prefer- 
ences within and between cultures were apparently 
much more important than cultural differences. In 
both cultures, however, a marked preference for 
fiction was found. Detailed data on reading prefer- 
ences may be found in the study. 

The relative performance of sixth-grade pu- 
pils of Anglo and of Spanish-American and Indian 
ethnic groups upon an idioms test was investigated 
by Yandelland Zintz (148). Тһе minority groups 
differed significantly from each other and from the 
Anglo control groups, to the disadvantage of the 
former, thus lending additional support tothe notion 
that minority groups in the American culture may 
experience considerable difficulty in understanding 
American idiomatic expression. Onthe other hand, 
Snider (136) showed that comparative performance 
acculurated Nez Perce Indian and white senior high- 
School students upon the Cooperative Test of Read- 
ing Comprehension was not significantly different, 
thus suggesting that factors of age and degree of ac- 
culturation may be important in such comparisons. 

Effect of TV Upon Reading. Schramm, Lyle, and 


ographies and reviews of research inreading іп 
The Reading Teacher, Review of Educational Re- 


Search, and the Proceedings of the International Reading Association an 


id National Reading Conference. 
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Parker (125) in a series of studies directed to the 
effect of television u pon children in ten American 
and Canadian communities, notedthe impact of tele- 
viewing upon children's vocabularies and upon read- 
ing habits and attitudes. Their observations аге 
similar to those reported previously by Witty and 
others. The relative impact of TV and other mass 
media were also shown upon the way in which a ma- 
jor news story is diffused by Deutschmann and Dan- 
ielson (33) and upon teen-ager's image of the press 
by Davis and Watkins (31). 

Leisure Activities and Scholarship. Hemmerling 
and Hurst (71) surveyed 202 high-school sophomores 
concerning nine leisure time activities, including 
reading. Leisure readingtime, reportedly one-half 
hour per day, was not significantly related to grade- 
point average, although amount of time spent in 
studying was highly significant. Detrimental effects 
for amount of time spent in outside work, listening 
to music, and going to the movies were also report- 
ed, 


2. Characteristics of Reading Materials 


That favorite item of Sunday reading, the Sunday 
comic strip, was analyzed by Barcus (10) for the- 
matic patterns and trends, major types, and uses of 
characters in 778 comic strips appearing from 1900 
to 1959 in three Boston newspapers. The most 
marked changes in types, themes, and continuity 
tended to take place inthe 30's, leading to more di- 
versification in current comic strips. Greater re- 
cent emphasis upon action, adventure and other 
topics and less on humor, an increase in sustained 
rather than episodic continuity, and a rise in the 
proportion of crime and detective as well as love and 
romance themes was noted. The author comments 
that the comic strip has asits major role the relief 
of daily tensions and tends to “reflect some of the 
underlying needs, fears, desires and values of the 
culture.” Further study of the latter factors as re- 
vealed in comic strip characters with which the 
reader may identify was reported underway. 


3. Reading Habits 


Magazine Reading. Binney (14) showed that when 
college freshmen were asked to recommend maga- 
zine articles that would be good for other freshmen 
to read, they overwhelmingly chose articles that had 
an informative or useful function rather than liter- 
ature or the fine arts. The author concludes that 
his students read for duty ratherthan pleasure, al- 
though the nature and context of his request may 
have influenced his results. Balow (9) found little 
or no evidence of reading of professional magazines 
among 268 undergraduate and graduate students in 
a midwestern college of education, except for pro- 
fessional magazines or organizations to which teach- 
ers belonged. 


Reading by the Aged. A bleak picture isdrawn 
of the book reading habits of an aging population in 
a rural southern town by Hoar (77). In a carefully 
drawn sample of 200 adults over 60 years of age, it 
was found that 60% spent no time in reading books 
and that over half of the population had not read a 
book in six months. There was some evidence that 
amount of education was a greater determinant of 
book reading than age, and that women read more 
books than men. 


4. Psycho-Social Effects of Reading 


Reading Attitudes and Opinions. Ina complex 
study designed to measure changes in cognitive con- 
sistency, McGuire (99) examined the effects of per- 
suasive syllogistic arguments upon the reader's log- 
ical consistency for central and relatedissues. One 
hundred and twenty college students were given sets 
of syllogistically related propositions to rate ac- 
cording to probability and desirability. A week later 
persuasive messages were introduced and the stu- 
dents retested. Afinalretest was given in the third 
week. Тһе author found а) a significant amount of 
inconsistency due to wishfulthinking in the first ses- 
sion; b) adecline in wishfulthinking and a significant 
changetoward consistency on the central and relat- 
ed issues after the persuasive messages were read; 
and c) increased consistency on related issues one 
weeklater. McGuire (100) used a similar technique 
to study other effects of persuasive messages іп а 
second study involving 35 college freshmen, The 
prediction that persuasion arguing in an inconsis- 
tency-reducing direction on central issues would 
lead to significant change was confirmed, but its 
greater permanence over change on related issues 
was not confirmed. Predicted significant change on 
related issues as a result of inconsistency-increas- 
ing persuasive messages was also confirmed. The 
temporal rate of decay for both ty pes of changes 
was found to be about the same. 

Manis (104) secured e vidence that various de- 
grees of commitment (highly committed, moderately 
committed, or neutral) by 70 psychology students 
toward fraternities significantly influenced their in- 
terpretation of anti-fraternity statements but not of 
those which were pro-fraternity or neutral, Adams 
(3) prepared three news stories representing three 
different positions on the issue of nationalism VS. 
internationalism. These were presented to 152 un- 
dergraduates totestthe effect of knowledge included 
about the spokesman's status and nationality upon 


student opinions. Рге- and post-tests showed that 
knowledge of nationality was highly significant, but 
that of status was пої. The author noted that “еуегу 
time an American advocated nationalism in conflict 
with a Britisher advocating internationalism, M 
subjects changed in the direction advocated by th 
American. ” The reverse proposition was also true. 


5. Newspaper Reading Habits and Effects, Brin- 
tonand McKown (20) sent questionnaires to 189 news- 
paper subscribers and 102 non-subscribers to study 
the effect of a substantial amount of newspaper in- 
formation about a current health issue, fluoridation, 
inthe adjoining city. Newspaper subscribers were 
found to be much more aware ofthe issue than non- 
subscribers, being likewise significantly better in- 
formed and more activein discussion. То a lesser 
degree the subscribers were more definite in want- 
ing to bring the issue to a vote and in knowing how 
they would vote. Radio and TV were found to be 
relatively minor sources of information. This study 
is of considerable interest because of the selection 
of an issue in which the questionnaire respondents 
were not directly involved and because of the com- 
parative analysis of data for subscribers and non- 
subscribers. 

s Davis and Watkins (30, 31) surveyed the reading 
habits of 4, 493 Texas teen-agers and their image of 
the press. While the composition of the sample is 
not described, it was the authors’ opinion that “(һе 
size of the sample and the care taken in selecting 
that sample make it representative of teen-agers in 
most parts of the nation." Representative reading 
habits were: 1) 6395 of teen-agers spend 15 minutes 
or longer reading the daily paper; 2) comics and the 
front page are the sections most frequently read; 3) 
School news, amusements, and cartoons are types 
of news of greatest interest, and 4) teen-agers tend 
to enjoy most those columns with which they can 
identify- surveys of teen-age opinions, advice to 
teen-agers, and sports. Data are also presented to 
Show that the teen-agers' image of the press is fa- 
vorable as a source of news and of emergency ap- 
peals for schools in comparison with TV and radio, 
and that newspapers are easier to read than general 
magazines, news magazines, and textbooks. 

Edelstein and Larson (42) investigated the role of 
а free urban weekly newspaper inalarge city 
through a series of carefully conducted interviews 
and concluded that it was an important “facilitative 
agent" for the community. It was found that the 
reader's orientation to the newspaper was closely 
related to his community orientation and that the 
newspaper served much more than a com mer cial 
function, It appeared to serve a vital part in the 
Communication system of the community. Larson 
and Edelstein (90) also reported further data from 
the same study on the respective roles of 267 hus- 
band-wife pairs interviewed independently, finding 
that consensus of opinion toward the urban weekly 
led to consensus in their orientation to the commun- 
ity. Many detailed findings of these studies are not- 
€d by the authors. 

Carlson (22) confirmedthe assumption that inter- 
езі innews is related to a number of personal goals 
and the perceivedusefulness and importance of these 
80als among 47 college students. He used independ- 
ent ratings of student interests and goals, then asked 

is subjects to rate the usefulness of three news 
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articles in contributing to their goals and evaluated 
their responses by means of a Satisfaction-Useful- 
ness Index. Carter and Mitofsky (23), suspecting 
that perceived distances reported in the news may 
not be accurate, examined the effect of regional 
proximity and knowledge through visitation of 100 
cities for which the distance was estimated by 35 
students іп a Southern university. The hypothesis 
that ‘‘cities in the subjects’ own region of the United 
States (the South) would be perceived as closer than 
comparably distant cities located in Northern States’’ 
was confirmed for 24 matched pairs of equally dis- 
tant Northernand Southern cities. No similar effect 
for cities visited was found. M 

Deutschmann and Danielson (33) reported the re- 
sults of a study of the nature and speed of diffusion 
of three major news stories by telephone interviews 
the day followingthe appearance of the stories. Тһе 
first source of news media was TV, radio and the 
newspaper, inthat order. No rapid word-of-mouth 
news transmission effects werefound. The diffusion 
process appeared to be a stable yet efficient pro- 
cess, reaching over 90 percent of the respondents 
in the two days’ time. In this process, the news- 
paper appeared to play a supplemental role to other 
media as a first source. 

Measurement of Psycho-Social Effects. Tannen- 
baum and Lynch (141) describedthe development and 
validation of an empirically -derived Sensationalism 
Index (Sendex) based upon the semantic differential 
technique for measuring connotative meaning. 
Twelve polar adjective scales, representing three 
factors identified as Evaluative, Excitement, and 
Activity were shown by factor analysis to account for 
60 percent of the total variance of a larger number 
of scales. Application of scale ratings to three 
versions (sensational, bland, and mixed) of three 
stories indicated that the three versions were signif- 
icant beyond the . 01 percent level and that the sen- 
sational version was significantly different from the 
other versions at the .001 percent level. Addition- 
alresearch intothe development of similar types of 
Scales for measuring attitudes toward a newspaper 
was reported by Lyle (92), who showed that 20 se- 
lected scales were correlated to a substantial de- 
gree (,86) with the Stanford test of newspaper atti- 
tudes. 

Haskins (70) used reader behavior and satisfac- 
tion as criteriato validate Gillie's Abstraction In- 
dex (AI) in a study of selected portions of an issue 
ofthe Saturday Evening Post. He found that the 
more concrete the article as measured by the AI, 
the more likely will be the reader to finish it; but 
that the more abstract the article, the higher the 
reader satisfaction. For readers completing an 
article, arank-order correlation of ‚80 significant 
at the . 01 percent level was secured. 


6. Measurement of Press Freedom. Greenberg 
(61) used the four variables of literacy rate, per 


capita income, daily circulation per thousand, and 
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number of daily newspapers in estimating press 
freedom in 32 non-communist bloc nations in 1956- 
58. Hefoundthat these factors accountedfor 81 per- 
cent of variance in the press freedom score against 
three criteria of governmental control of access to 
news, domestic press, and foreign press. The cor- 
relation of literacy rate with the other variables 
was substantial except for number of dailies. 


The Psychology of Reading 
1. Psychological Correlates of Reading. 


Personality and Reading. Inastudy con- 
ducted in England, Lynn (93) tested the assumption 
that ‘‘those who do well in educational tasks should 
be introverted and generate reactive inhibition 
slowly’’ in 82 children, aged eight to eleven. Cor- 
relations of . 33 and -. 23, respectively, betweenthe 
Schonell Graded Reading Vocabulary Test and tests 
of attention and reminiscence were significant at the 
five percent level, thus yielding suggestive but not 
conclusive evidence of the relationshipunder inves- 
tigation. 

Porterfield and Schlichting (117) hypothesized 
a significant relationship between peer status and 
reading achievement, irrespective of area of pres- 
tige and socio-economic levelof school or commun- 
ity. A multiple factor sociometric questionnaire of 
five items (choices of classmates for social, sports, 
political, and academic activities) and the Stanford 
Reading Test were given to 981 pupils in grade six 
classes of 15 schools of high, middle, and low so- 
cio-economic status. When scores by classes were 
rank-ordered into thirds and subjected to Chi- 
square analysis, significance at the five percent 
level was obtained for the original hypothesis, for 
each area, апа for each level except for social and 
sports choices in the low socio-economic group. 

Cassel and Haddox (24) failed to find a sig- 
nificant relationship between r ead ing achievement 
and tests of tension and needs, and of social insight 
among 200 ninth graders. However, Anderson (4) 
reported that among 415 entering students at the U- 
niversity of Western Australia, anumber of factors 
on the Cattell 16 Personality Factor Questionnaire 
were significant at the . 0175 level in relation to 
Scores on the Cooperative Reading Test. Better 
readers appeared to be more intelligent, more cas- 
ual, more sensitive, more introverted, more radi- 
cal, and more self-sufficientthan poor reader s. 
Moderate multiple correlations between .3 and .5, 
and between . 4 and . 6 whenintelligence was includ- 
ed, were obtained for reading achievement and per- 
sonality scores. 

Roth (123) studied the self-concept in relation to 
the reading performance of 54 university freshmen 
ina voluntary reading improvement program in which 
the consistent use of rate-training films made con- 
tinuing demands upon the reader. Onthe assump- 
tion thatthis situation would produce variable effects 
of defensiveness accordingto degree of student suc- 


cess, several indices were used to determine the 
self-perception of students classified as Improver, 
Non-improver, and Attrition groups. Тһе author 
found that amount of general defensiveness and spe- 
cific defensiveness with respect to Self as Reader 
was greatest in the Attrition group and least in the 
Improver group. Data on several other factors are 
included in the study. 

In reportingona continuing longitudinal study, 
Durkin (38) noted a number of characteristics shown 
by 49 California children of Caucasian, Negro, and 
Oriental origin who learned to read before entrance 
to school. While the average Binet І. ©. of this 
group was 121, over one-third had I.Q.'s of less 
than 110, and the group range was 91 to 161, Dur- 
kin found that the majority of these children were 
“curious, conscientious, serious-minded, persis- 
tent, and self-reliant," that all were read toat 
home from an early age, and that older siblings 
frequently helped them learn to read. She tenta- 
tively concluded that personality factors and early 
attitude toward reading were very important in the 
early reading success of these children, and that 
current tests of intelligence are inadequate for pre- 
dicting such early reading achievement. 

Reading and Other Variables. Bloomer (15) 
identified the variables of word frequency and con- 
creteness as most closely correlated with reading 
difficulty, but found that these two variables, plus 
multi-ordinality of meaning, were most highly cor- 
related with spelling. Words in the 149 word sam- 
ple analyzed were randomly selected from the New 
Iowa Spelling Scale. The grade level at which 50% 
of the children spelled the word correctly, and read 
and chose the word correctly, were the criteria 


used. 
Williams (146) reported that scores on a Jangu- 


age scale devised bytheauthor showed a significant 
rank-order correlation with scores on the Metro- 
politan Reading Test at all but three grades in K-6, 
but no specific data arecited. Gunderson and Feldt 
(65) examined the relationship of intelligence test 
Scores and performance on the Iowa Tests of Basic 
Skills. One hundred children, closely matched up- 
on total IQ, were dividedinto four groups of 25 chil- 
dren each with four distinct patterns of language/non- 
language IQ's: extreme language, equal, moderate, 
and extreme non-language IQ patterns. In all areas 
of the Iowa Tests of Basic Skills the highly verbal 
group ranked first andthe extreme non-verbalgroup 
ranked last. Greatest differences were found in 
reading, language, and vocabulary, while smallest 
differences were found in work-study skills and arith- 
metic. Specific teacher observations reported an 
awareness of the special talents of the highly verbal 
groups but similar recognition of the special skills 
of the non-verbal groups. : 
On the Michigan State University Reading 
Comprehension Test, Stewart(139), not surprising- 
ly, found significantly positive differences on 2 tes 
of logical reasoning between selected college fresh- 


men with high logical reasoning scores and those 
with low scores in relation to reading performance, 
Оп the other hand, McConville (95) failed to find 
significant differences between laterality, reaction 
time, and a test of reading rate and comprehension 
given to 110 introductory psychology students, 
Correlations between reading achievement 
on а number of reading tests and other aspects of 
school achievement may be found in the Educational 
Records Bulletin No. 78 (44), 
2. Developmental Factors and Reading. 


Sex Differences in Reading. Gates (54) pub- 
lished the test results for 6,646 boys and 6,468 
girls in several school systems and states who had 
been given the revised Gates Reading Survey for 
grades two to eight. Mean raw scores on reading 
Speed, vocabulary, and levelof comprehension con- 
sistently favored the girls, in most instances sig- 
nificantly so. A greater proportion of boys re- 
ceived low scores, and their SCores were more 
variable than girls' except in speed of reading in 
grades two, three and six. The author believes 
that environmental rather than maturity factors are 
the more likely explanation of the superior perfor- 
mance of girls. 

Vocabulary Development. When a vocabu- 
lary scale is based upon a random sampling of the 
dictionary, rapid progress in vocabulary develop- 
ment is shown for ages six to eight and becomes 
Slower after age eight and a half, according toa 
Study by Williams (145) of 216 children in England. 
The author developed two 100-item scales which 
were given alternately to a subject as a word re- 
Cognition and a word comprehension test, together 
with the Schonell R.I. Word Recognition Test. The 
results of the scale were combined to show the de- 
velopment of vocabulary as a percentage of the dic- 
tionary sample by Schonell reading ages. Тһе au- 
thor urges wider use of such a scale in preference 
to age Scales, presumably because the former re- 
flects absolute rather than relative growth. 

Ervin and Foster (45) compared the ability of chil- 
dren in grades 1 and 6to correctly identify the phys- 
ical attributes of objects and the personal attributes 
of pictured faces. Wide ranges were found in both 
&roups in the correct attribution of the attributes— 
heavy, big, strong—to objects, but differences con- 
Sistently favored the older children, For such per- 
Sonal attributes as clean, pretty, and happy ascribed 
to faces, the results were very uneven. The authors 
Conclude that attributes which have metaphorical and 
and connotative links in adult usage may be denota- 
tively contused by children, 

Dale and others (28) identified in an interim re- 
Search report those words foundto be familiar though 
tests of word m e aning to 50 percent of each of the 
Populations of 200 or more children in grades 4, 6, 
8, 10, and 12. 

: Flavell and Stedman (37) investigated the a- 
bility of children in grades two to nine to rank-or- 
der word pairs according to eleven categories of 
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*"logico-grammatical" relationships. Illustrative 
categories used were synonyms (fast-swift), coor- 
dinates (dog-cat), contrasts (good-bad), and whole- 
parts (tree-limb). In the task of choosing which of 
two pairs were most closely related in m eaning, 
“groupagreement in rank- ordering of wor d pairs 
in terms of semantic similarity showed a more or 
less regular increase with аре.” While young 
children did not show a definite similarity of mean- 
ing hierarchy, somewhere between grades three 
and five anadult pattern begins to emerge (in 
comparison with a previous adult study). Thus the 
Study revealed an ‘‘absence of definite developmen- 
tal change, " the possible reasons for which are 
discussed at some length by the authors. The fact 
that children in the intermediate grades appear to 
be able to make complex semantic judgments simi- 
lar in pattern to adults is provocative in its impli- 
cations for the teaching of reading at that level. 
Anderson (4) found in Western Australia a signifi- 
cant increase with age in vocabulary development, 
but a decrease with age in reading speed and com- 
prehension among 278 adults aged 18-62. The data 
were obtained on the Cooperative Reading Test for 
а limited sample biased toward the clerical and 
professional classes. 

Reading Growth in the Mentally Retarded. 
According to two surveys by Groelle (62) of an ex- 
tensive population of educable mentally retarded 
children with mean 1. Q. 's of 68 in elementary, 
junior high, and senior high schools, there is an 
*'increasing discrepancy between over-all reading 
age and mental age as the pupils become older. ”’ 
On the average, reading ages were found to be 
slightly above mental ages atthe elementary school 
level, four to six months below expectancy in junior 
high, and eight to twelve months below expectancy 
insenior high school. The important implications 
of these findings for the timing and emphasis upon 
reading improvement are discussed at some length 
by the author. Groelle also notes that if a discrep- 
ancy of one year between mental and reading age is 
taken as the criterion for reading disability, only 
25% of the senior high school population are bona 
fide reading disability problems. Inanother study 
of a group of 42 institutionalized mental retardates 
with I. Q. 's ranging from 47 to 81, Jones,Grass 
and Van Why (81) secured evidence over a three- 
year period of one-half year ’s growth on the Cali- 
fornia Reading Test. Ananalysis of variance 
Showed that the over-all differences in reading a- 
chievement for this age range were not significant, 
although females were significantly better than 
males. 

Reading Interests. Wolfson (147) found по 
evidence of change from grade to grade in the pat- 
tern of reading interests expressed in response to 
an oral interest inventory given to approximately 
2,000 children in grades three to six, but sex dif- 
ferences were evident in all twelve categories sam- 
pled except one, social studies. Additional evidence 
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of wide differences in reactions of inter mediate 
grade children, when askedto read and rate selec- 
tions on an interest-aversion scale, was provided 
by Coleman and Jungeblut (27), plus some evidence 
of differences among grade levels and communities 
sampled for given selections. The technique used 
in this study provided immediate validation of inter- 
est ratings, a matter of primary concern to Haskins 
(69) in his study to validate predictions of reader in- 
terest. A novelapproach to the fiction reading in- 
terests in grades seventotwelve was made by Smith 
and Eno (135). The question asked of 510 students 
was ‘‘If you could have an author write a story-to- 
order for you, what would you have him put in it?” 
From junior high to senior highschool levels, there 
was considerable shifting in the ordering of cate- 
gories by boys, but little for the girls whose dom- 
inant preference continued to be romance, 
3. Reading and Learning Processes. 

Perception in Reading. D iack (34, p. 126) 
cites a number of observations and informal exper- 
iments with children to support his contention that 
letters have meanings, and that * real reading" 
means that the reader ‘‘knows what letters are, 
what they look like individually, what they mean in 
terms of sound and what the order inwhich they ap- 
pear теапв.” By this, Diack appears to mean that 
letters form the significant discrimination unit in 
beginning reading processes, an assumption that is 
currently under rigorous examination by several in- 
vestigators. Тһе balance ofthis book is devoted to 
an analysis of the Gestalt theory which he finds in- 
adequate to support a whole-word approach to be- 
ginning reading. 

Ghent and Bernstein used tachistoscopic 
techniques to study children's perception of form 
orientation in pre-school children and those of early 
Schoolage. Exposure time was that at which an 
age grouprecognized approximately half of the items. 
In a study of realistic form orientation in four posi- 
tions, Ghent (56) found by analysis of variance that 
age 3and4 children, but not children aged 5-7, rec- 
ognized figures in anupward orientation significant- 
ly better than in any other orientation. The author 
notes that this findingfor the younger children is at 
variance with earlier studies by Rice (1930) and 
Davidson (1934) regarding directional preference. 
Ghent (55) likewise found that scanning of geomet- 
ric forms in two orientations appeared to be ina 
downward direction, and that children interpreted 
a figure as upside down “‘when the focal portion was 
in the lower half of the card.’’ A second study of 
geometric form recognition in several orientations 
by Ghent and Bernstein (57) showed that the recog- 
nition for right-side-upfigures was significantly dif- 
ferent from that of other orientations. These stud- 
ies include implications of the findings for reading. 

Two studies involving discrimination of letters 
and words likewise appeared, Muehl (110) studied 
the effects of three types of visual discrimination 
pre-training in37kindergarten children upon learn- 


ing a vocabulary list. Matching pre-training was 
given on the samewords, different words, and geo- 
metric forms tothree groups of children prior to the 
learning task, which consisted of 12 trials by a 
paired-associationanticipation method. It was found 
by analysis of variancethatthe mean number of cor- 
rect responses over trials differed significantly at 
the . 01% level in favor of the group pre-trained on 
the same words, while, from the other pre-training 
conditions, there was no evidence of transfer, either 
positively or negatively. The experimenter observed 
from pupil reports and from an analysis of errors 
that ‘‘single letters or details of these letters ap- 
peared to bethe basis of word matching. ° Doehring 
and Rosenstein (37) examined differences inthe rec- 
ognition of tachistoscopically presented letters, tri- 
grams, and words among groups of 40 orally- 
trained deaf childrenand 40 normally-hearing chil- 
dren in two age groups averaging C A 10,3 andC A 
13.9. No significant differences were found be- 
tween the older hearing groups, but younger hearing 
children were significantly better on each task than 
their deaf counterparts. Bothdeaf groups were also 
shown to have significantly smaller vocabularies on 
the Ammons Full-Range Picture Vocabulary Test 
than the normally hearing groups. The authors feel, 
however, that the relatively strong performance of 
the older deaf group was ‘‘dependent upon an esti- 
mate of the probability of occurrence of the visual 
stimulus rather than upon the mere frequency of 
prior auditory and visual stimulation. ” 


Dyer and Harcum (39) likewise used tachis- 
toscopic techniques with a: group of 66 nursery and 
K-2 children to test the hypothesesthat there would 
be no significant differences inaccuracy of percep- 
tion to the right and left of a fixation point before 
formal training in reading, but that there would be 
fewer errors in the left visual field after training in 
reading. The first hypothesis was confirmed, but 
the second was not, for fewer errors atthe ‚02% 
level of significance were found inthe right visual 
field after training. Intelligence and reading pro- 
ficiency were not controlled in this study. 

Beck (12) found no significant differences in 
number of reversals when 44 second-grade children 
were categorized with respect to mixed and estab- 
lished laterality, and when known reversal tenden- 
cies were tested binocularly and monocularly with 
the dominant and non-dominant eye. The children 
were of average intelligence and reading achieve- 
ment and were presented with 78 symbols consisting 
of letters, numbers, and words. The author ques- 
tions the theory that binocular vision per se is the 
chief cause of reversals, preferring rather an €x- 
planation of inadequacy in the development of visual 
perception. 

By correlating test scores on a reading test 
with two of Thurstone’s tests of speed of closure 
and one of flexibility of closure, Harootunian (6 
found no significant differences for speed among 20 
students in grades seven to eight, but a highly sig- 


nificant difference for flexibility of closure. He 
speculates that flexibility of closure may be indica- 
tive of the higher mental processes involved in 
reading. 

Reading and Meaningful Association. Hinze 
(74), using protocols of interviews, investigated the 
nature and direction of free association res ponses 
of 25 college English students to two prose para- 
graphs, one affectively toned and the other not thus 
toned. Key stimulus words in each paragraph were 
chosen, then validated by judges in the department 
of English, who likewise validated the experimen- 
ter’s analysis of the protocols. In an initial inter- 
view, the subjects classified each stimulus word as 
positive or negative, while in the second interview, 
subjects defined each stimulus word and discussed 
each paragraph at some length. Тһе consistency of 
associations with the initially established direction, 
positive or negative, led tothe classification of uni- 
directional, conflicting, and neutral categories of 
associations which were examined in relation to ad- 
equacy of interpretation. Hinze found a highly sig- 
nificant interaction between the presence of conflict 
words and misinterpretation, and a significantly 
greater amount of misinterpretation in the affec- 
tively-toned paragraph, according to Chi-square an- 
alysis. The study should be consultedfor other de- 
tailed findings. 

Hartman (68) devised a simple associative 
learning situation in which print, sound, and pictor- 
ial presentations of persons’ names could be vari- 
ously combined, a series of 75 five-second on-cam- 
era pictures of men and women serving as the pic- 
torial stimuli. Over 1,000 university freshmen 
were presented with 25 items of this information 
under varied modes of presentation and then were 
Tequired to recognize these items among the total 
75 items presented randomly. Single, as well as 
dual and triple channel presentations in all possible 
Combinations were made. Among the findings of in- 
terest is the fact that in single presentation chan- 
nels, print had the highest mean, and significantly 
50, while the facilitation between audio and print 
Channels was found to be very great. 

Reading and Retention. Ausubel (8) conduct- 
ed a controlled experiment with two groups of 40 
college seniors to determine the effect of introduc- 
ing general concepts relevant toanunfamiliar tech- 
nical passage prior to its study, upon delayed re- 
tention 60 hours later. Significant differences be- 
tween ‚01%, and . 05% were found favoring the exper- 
"mental group, which was matched with the control 
8toup for sex, field of specialization, and scores 
on a pre-test, 

ur. Falconer (46) described the effects of using 
a stimulus-response, multiple-choice, ele ctro- 
Mechanical teaching device’’ for brief periods upon 
е learning and retention of a group of eight six- 
109 Seven-year-old deaf children. Learning ef- 
mad was shown to be high with virtually no drop 
“retention after two weeks. 
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Reading and Purpose. Shores compiled 
three reports upon the reading of science materials 
by two sixth-grade classes for two distinct purposes, 
including some comparisons with the reading of the 
same passages by a group of able college readers, 
The differentiated purposes were to read for main 
ideas and to read for sequence of ideas. In the first 
study (129), adults were shown to be much more ef- 
ficient and flexible than the children, while a ques- 
tionnaire to the children showed that their goal ori- 
entation was unclear and confused as they reported 
on perceptions of their own reading habits and those 
that they would attribute to an ideal reader. The 
second and third studies (128, 127) showed the corre- 
lations among the various tests used in the study, 
including the Iowa Silent Reading Test and tests de- 
vised by the author to measure the comprehension 
of science materials, In general, Shores found that 
the purpose set by the task influences speed, and 
that in science reading for a directed purpose such 
as toreadfor the sequence of ideas, there was little 
or no relation between speed and comprehension, 

Sister Herculane (72) studied the extent to which 
102 eighth-grade students actually varied their rate 
and techniques of reading and appearedto understand 
the concept of flexibility accordingto purpose. The 
pupils were found to have no verbal or functional 
understanding of the concept, although 90 percent of 
the children recognized its desirability. The im- 
portance of this skill and its development is stressed 
by the author. 

4. Hygiene of Reading. Flores (48) reported 
two experiments on conditions governingthe legibil- 
ity of selected ty pe faces, including an Air Force 
font designed to be read by machine, He found that 
under conditions of marginal illumination and short 
exposure, but with adequate comparison time, dis- 
tinguishability depended mainly upon large pattern 
differences; but that when lighting is optimum and 
judgment time was limited, identification depended 
to a great extent on the similarity of the type to that 
dealt with indaily experience. Howell and Kraft (78) 
reported three experiments designed to quantify 
psychophysically three variables involved in letter 
recognition— image size, blur, and contrast. They 
found that a) size and sharpness judgments were re- 
lated to letter height and blurredness, respectively, 
as straight line functions, but that the relationship 
for contrast was hyperbolic; b) the reduction of two 
of three dimensions led to a disruption of sharpness 
judgments, but not to those of size and contrast; 
and c) reduced sharpness increased the apparent 
size of the stimulus figure, but reduced contrast 
decreased it. The authors' contentionthat the many 
studies of legibility have not quantified the percep- 
tual continua involved appears well taken. 

Readability of Texts. Diener and Kaczkowski (35) 
found, in applying the Yoakam Readability Formula 
to three commonly used occupational hand bo ok s — 
Career Opportunities, Occupational Briefs, and Oc- 
cupational Outlook Handbook—that each was reason- 
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ably difficult, particularly the latter. Similarly, in 
a sampling of 24 small and moder ately-sized high 
Schools in Indiana, Bentley and Galloway (13) found 
that the ten most frequently used agricultural ref- 
erence books in high schools were too difficult in 
relation to the reading ability of the students using 
them in grades nine to twelve, according to the Dale 
-Chall formula. 

In a study of the readability of the ten most fre- 
quently used college general biology textbooks in 
1953-54, Major and Collette (103)used the Flesch 
Readability Formula for analyses of vocabulary load 
and comprehension level. Theyfound that the aver- 
age readability level of these books was at least one 
grade level above that of an above-average college 
reader, and at least two grades abovethat of an av- 
erage college reader. A highsyllableindex, acom- 
mon feature of scientific writing, was found to be 
the chief factor contributing tothe reading difficulty 
of the materials. After reviewing other aspects of 
readability, the authors make five specific recom- 
mendations to improve the readability and/or use of 
these texts. Pace (114) likewise used the Flesch 
readability formulas to show that a sampling of five 
current books of readings in psychology indicates 
that they tend to be difficult to read and low on 
Flesch’s human interest factor. 

Headline Readability. Hvistendahl (79) tested 
headline readability in context to secure a more 
normal reading situation. Three-line headlines 
were prepared in capital and lower-case letters and 
in lower-case letters to compare the relative read- 
ing speeds of 80 college subjects for 50 headlines in 
both type formats. No significant differences were 
found for readability as measured under these con- 


ditions. 
Devereux Readability Index. A readability index 


based upon average sentence lengthand average 
number of characters per wordisdescribed by 
Smith (132). Validation of the use of the number of 
characters in a word as an index of word difficulty 
is offered by the writer who believes that this pro- 
cedure is simpler, more efficient, and more accu- 
rate than other alternatives. The index was vali- 
dated upon the Guiler and Coleman Reading for 
Meaning series and upon materials inthe California 
Achievement Tests for grade levels two to twelve. 
Other data upon the reliability ofthe index, its com- 
putation, and the conversion of the index into grade 
level equivalents are contained in the article. 
Other Aspects of Readability. Kearl and Powers 
(83) examined differences in the ability of three 
groups of workers (soil research specialists, soils 
extension workers, and agricultural journalists) to 
predict the difficulty of a test for farmers on under- 
standing of soiltechnology. Test difficulty was val- 
idated by the responses of 102 Wisconsin farmers. 
Agricultural specialists tended to slightly underes- 
timate the difficulty of the test, those who worked 
most directly with farmers in the field, the exten- 
sion workers, most closely approximating the diffi- 


culty level indicated by the responses of farmers, 

MacGinities (97) reported the effects of omission 
of words іп the contexts of two short passages of 
literary prose upon the ability of 600 college students 
to restore the omitted words from context. Omit- 
ting every thirdword was found to make restoration 
difficult but the larger intervals of every sixth word 
and less frequently produced no significant differ- 
ences in restoration difficulty. Words omitted in 
pairs and in groups offour were progressively more 
difficult to restore. The author noted that the influ- 
ence of context upon word choice in English prose 
seems to decrease rapidly with the distance of the 
context, andthat context more than about five words 
distant has relatively little effect upon restoration 
difficulty. Since the selections used were a short 
story by Poe and a fairy tale, the applicability of 
these findings to other types of context remains to 
be determined. 


The Physiology of Reading 


Reading and Neurological Defects. The transla- 
tion of a 1955 treatise on congenital word-blindness 
by Hermann (73), a Danish neurologist, appeared in 
1959. The author states: “І һауе takenas a starting 
-point the findings of clinical neurology and the re- 
sults of genetic investigations, andthe conclusion 
is reached that congenital word-blindness must be 
presumed to depend on a disturbance of directional 
function. It is highly probablethat the specific fac- 
tor must be sought in heredity, whereby a predis- 
position to the underdevelopment of this function is 
transmitted in certain families by means of a dom- 
inant inheritance," (p. 179). In reviewing relevant 
clinical cases, the author uses а comparison of the 
symptoms of word-blind individuals to those pos- 
sessing Gerstmann's syndrome, “а peculiar dis- 
turbance of body image. ..due to disease of a cir- 
cumscribed area in the parietal lobe” (p. 110), 28 
primary evidence to support the medical contention 
that the cause of word-blindness is specific and 
pathological. In this lucid and tightly-knit book, the 
medical and psychological viewpoints toward one 
form of severe reading disability are carefully com- 
pared for the consideration of those who work with 
severe reading disabilities. 

In America, Goldberg, Marshall, and Sims (59) 
used electroencephalographic techniques (EEG) in 2 
controlled investigation of possible sub-clinical 
brain damage of 25 reading-clinic cases, of normal 
intelligence and no ocular abnor malities, put re- 
tarded two or more years in reading ability. The 
control group was 25 randomly selected normal 
readers. Abnormal EEG patterns were found in 2 
of the 25 experimental cases whereas only two such 
abnormal patterns were found in the control group, 
a finding which led the investigators to conclude thal 
sub-clinical brain damage in cases of congenital dys- 
lexia may existandbe identified. Cohn (26) reper 
eda two-year clinical study of 46 children W! 


HARRIS 


reading and writing difficulties in c o m parison with 
a control group of 130 randomly selected children 
from normal classrooms. Through the use of a 
comprehensive battery of tests, including EEG's, 
Cohn concluded that **delayed development in the use 
of graphic symbolsfor communication purposes was 
primarily an ex pr e ssion of general disturbance in 
neurological function. ’” 

Silent Speech and Silent Reading. The cooper- 
ation of educational and medical specialists is evi- 
dent in the recent translation of Edfeldt’s (43) inves- 
tigation of this topic. The author describes in detail 
the steps taken to validate, through the use of the 
EEG, the presence of silent speech by recording 
changes in EEG patterns in the mylohyoid muscle at 
the base of the tongue. He concludes that such ac- 
tivity does represent silent speech. To test the 
presence or absence of silent speech, 84 entering 
Students at the University of Stockholm, averaging 
22 years of age andrepresenting a normal distribu- 
tion of reading ability, were randomly assigned to 
four experimental task treatments in reading Swed- 
ish texts: а) easy and clear text, b) easy and blurred 
text, c) difficult and clear text, and d) difficult and 
blurred text, Students were divided into good, me- 
dium, and poor readers, thus creating an exper- 
imental design with seven replicates for 12 experi- 
mental conditions which were analyzed by analysis 
of variance, Edfeldtfound that good readers engage 
in significantly less silent speech than do poor read- 
ers, that easy text materials are read with signifi- 
cantly less silent speech thandifficult ones, and that 
the reading of clear text tends to produce less silent 
Speechthan does the reading of unclear text. Low 
reading scores were accom panied by high silent 
8peech scores, but even good readers showed in- 
creases in the amount of silent speech activity when 
confronted with difficult text. Edfeldt concludes 
that “It is then impossible to view silent speech as 
а habit detrimental to silent reading. While the 
8enerality of the latter conclusion may be open to 
Question, the author has made a valuable contribu- 
tion to the methodology of study ing a complex and 
difficult problem. 


Serum Correlates of Reading Disability. In a 
brief note on eight older adolescent males with se- 
vere reading retardation, Smith(133) reported asso- 
ciated test results found by biochemical analysis of 
Serum to Supplement previously reported findings. 


Visual Accommodation and Reading. Eames (40) 
oun un urban and suburban children aged five 
to eight that there were no cases of accommodation 
inadequacy among five-year-olds nor among subur- 
ban children at any age tested. Accommodation 
pavers for early reading tasks in schools were 

Ound to be greater at these age levels than would be 
*Xpected later, and were thus presumed to be ample 
sofar as this visual function is concerned. 
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The Teaching of Reading 


1. Status of Reading Instruction. In developing 
new norms for four batteries of the Gates Reading 
Tests, Gates (52) compared the mean performance 
of approximately 107, 000 children in 1937 with that 
of about 31,000 children in 1957, with respect to 
reading grade and age status. Equivalent scores 
on the new and old tests were determined by giv- 
ing both editions of the tests to about 1, 630 pu- 
pils in 1957. The author foundthat grade level 
comparisons slightly favored the 1937 children, but 
that when children of comparable ages were com- 
pared in the grade range of four to 6.5, the 1957 
pupils were on the average about 5.3 months ad- 
vanced in reading achievement over the 1937 pupils. 
Greatest differences on the Basic Test of Reading 
to Understand Precise Directions were found for all 
children, but particularly for fifth-grade children. 

Baughman (11) analyzed responses to 133 
questionnaires from Illinois junior high schools 
concerning special reading instruction, Selected 
findings were that over half of the schools had such 
programs, with major responsibility for them as- 
sumed by the English Department. Slightly over 
one-third used specially trained teachers, and pro- 
grams were concentrated largely in grades seven 
and eight. About 25% of the schools reported devel- 
opmental programs which included all or nearly all 
students. The most important functions stressed 
were those of developing meaning, recognizing pu- 
pil needs and interests, and providing for good study 
habits. 

Mersand (107) in a study designed to explore at- 
titudes toward English teaching from such widely 
varying sourcesas business, government, libraries, 
the press and the legal profession as well as from 
educators, reported a wide spectrum of observa- 
tions relating to reading. Generally favorable re- 
ports of reading tastes and habits were secured 
from librarians. Other respondents in other areas 
stressed the need for greater emphasis upon reading 
skills, both basic and advanced, while the im por- 
tance of reading was consistently recognized by re- 
spondents in many fields. 

Acker (2) surveyed the status and trends in 
177 adult reading improvement programs in mili- 
tary, business, and government agencies. Repre- 
sentative trends included attention to more mature 
aspects of reading, longer courses, and the devel- 
opment of new materials and testing devices. Less 
time was devoted to the use of mechanical devices 
although the quantity of such devices was not found 
to be appreciably reduced. 

2. The Teacher and Reading. Austin et al. , (7) 
investigated the status of college and university pro- 
grams for the preparationof teachers of reading by 
means of 530 questionnaires received from 371 insti - 
tutions andfield interviews at 74 r andomly selected 
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institutions representing major geographical re- 
gions of the United States. While the findings and 
the 22 recommendations of this report are too nu- 
merous for inclusion here, it should be noted that 
the report includes data on broad curricular and ad- 
ministrative provisions, and upon faculty back- 
ground and opinions, as well as uponthe conduct 
and content of reading courses. Itis suggested that 
the reader, in evaluating the report, carefully ex- 
amine the questionnaire items and related data giv- 
en in the appendix. 

Strang (140) analy zed responses to а question- 
naire from 77 colleges and universities concerning 
courses on the teaching of reading in high schools 
and colleges, and concluded that many more such 
courses shouldbe available for adequate preparation 
of teachers at these levels. 

McGinnis et al., (98) reported the results of 
an inyentory completed by 549 elementary teachers 
and 54 administrators in Michigan concerning prob- 
lems in the teaching of reading and provisions for 
in-service training. Teachers' responses were 
significantly greater in number with respect to 
problems in teaching basic skills, especially phon- 
ics, in identifying causal factors in poor readers, 
and іл not having ап in-service program. Оп ће 
other hand, administrators feltthatteacher prepar- 
ation and adequacy, and the need for teachers' pro- 
fessional meetings, clinics, and demonstrations 
were to a significant degree im portant problems. 
Cappa and Schubert (21), in a sample of 83 inter- 
mediate teachers, discoveredthatonly athird of the 
group felt that they were making inadequate provi- 
sions for bright children with I.Q.'s over 130, and 
that most of the teachers appeared quite comfortable 
in working with these children. 

Haag, Sayles, and Smith (66) summarized the 
requirements for certificationof reading specialists 
in 46 states, finding that only twelye states have 
Such requirements, which tendto vary considerably, 
and that the trend toward such certification is rela- 
tively recent. 


Grouping. Skapski(130) made a series of 
comparisons of the reading achievement of grades 
two and three pupils in an ungraded primary pro- 
gram with those in arithmetic and reading in a grad- 
ed primary program in the same school system. In 
all comparisons, including class, school, and abil- 
ity levels, which averaged 116 1.0. for all second 
and third grades, the ungraded primary pupilsa- 
chieved significantly better and closer to expectan- 
cy than graded pupils. Ingram (80) reported sim ~ 
lar favorable results for the ungraded primary in a 
Michigan city for a group of 68 th ird-grade pupils 
butcites no controls upon the data. 

Rothrock(124) compared heterogeneous, ho- 
mogeneous, and individualized forms of grouping 

for 84 pupils in grade 4 and 102 in grade 5, equally 
divided amongthe three groups, ina controlled one- 


year experiment. The average І. ©. was111, and 
the groups were testedupon the Iowa Basic Tests of 
Reading Comprehension and Work-Study Skills. By 
analysis of covariance, no significant differences 
were found among the groups in reading gain in fifth- 
grade comprehension, but the reading achievement 
of the homogeneous group was significantly higher 
at the .01% level of confidence for the other compar- 
isons made. The author notes that teacher motiva- 
tion and competence were uncontrolled variables and 
that results on a reading attitude inventory favored 
the individualized instruction group. Smith and 
Becker (134), on the other hand, report very favor- 
able progress in an uncontrolled study of a ‘‘self- 
selection" reading program of a year and a half in 
grades 5 and 6 onthebasis of several types of eval- 
uation. 

Vocabulary Analysis of Primary Texts. Groff 
(63) tabulated the number of new words at sueces- 
sive levels in twelve primary basal readers, partic- 
ularly for teacher use inindividualized reading pro- 
grams. Kolson and Fullmer (85) c o m piled a basic 
word list of 184 words, including nouns, according 
to frequency in eleven primary basal readers and 
compared these to the Dolch and International Kin- 
dergarten Union lists. Astudy by Kerfoot (84) gives 
a first- and second-grade word list drawn from six 
arithmetic texts and indicates their correspondence 
to the Gates and Dale primary vocabulary lists. 

Diack (34: Appendix) made an analysis of the vo- 
cabularies of eight primers used in English schools, 
including some adaptations of American primers, 
to show the differences in w hole - word vocabulary 
control versus ‘‘letter-meaning’”’ control. The an- 
alysis of phonically controlled primers shows many 
different words, few different letter mean ings, 
many phonically simple words, few irregular words, 
and a low ratio of letter-meanings to words. The 
reverse is true for whole-word approach primers. 

Comparative Methods: Conventional Mater- 
ials. МасКіппоп (101) published a controlled inves- 
tigation “into the lear ning process as exemplified 
by young children learning to read.’’ The experi- 
mental materials consisted of those developed by C. 
M. Gibson and I. A. Richards which were employed 
in comparison with a conventional у h ol e-word ap- 
proach among small but carefully matched groups О 
Scottish five-year-olds. The experimental group 0 
twelve children read the material aloud while elab- 
orate protocols were taken for ten meetings. One 
control group read the same material independently, 
while another read conventional mater ials al 
Significant differences favoring the expe rimenta 
group were consistently found in reading progress 
in two studies, one with beginning readers an 4 
other with children already reading. Detailed rec 
ords were made of the children’s reading атг 
drawing and printing activities, use of speech, ie 
social growth, again to the advantage of the expe | 
mental subjects. Тһе author makes а compres 
sive examination of the linguistic and psycholog 


theory underlying the use of the experimental ma- 
terials and stresses the needfor further experimen- 
tation. He notes that “What the present research 
appears to do primarily is to act as a reminder of 
the fact that how young children do learn to read is 
still little understood. ” (p. 238) 

Bloomer (16) reported, in what appears to be 
a more complete account of a previous study, acon- 
trolled experiment with four exper imental and six 
control classes in beginning reading. Results оп the 
Gates Primary Reading Test favoredthe experimen- 
tal groupusing materials controlled according to a 
“Progressive Choice Method’’ in which the letter 
was the basic learning unit. When readiness was 
held constant, the experimental group was superior 
to the control group except in paragraph reading. 
The author notes that at the end of the 17-week ex- 
perimental period, the experimental group read the 
basal reader used by the control group very inde- 
pendently and in half the time spent by the control 
group. 

Gates (53) analyzed the progress of 267 child- 
ren taught for a varying number of years by an iso- 
lated phonics approach infour New York schools, 
He reported that the accomplishment quotients of 
these children were below 100, that the number of 
years spent in this approach showed no relation to 
reading ability, and that intelligence was not a sig- 
nificant factor in their reading progress. 

Morgan and Stucker (109) matched 90 pairs of 
high achievers and low achievers in reading to se- 
cure four experimental and four control groups of 
fifth- and sixth-grade children for experimentation 
with the Joplin plan of teaching reading. Matching 
was on the basis of reading scores on the Durrell- 
Sullivan and California Reading Tests. Atthe end 
of the academic year, the meanscores inreading of 
both the high and low achieving experimental groups 
Were “significantly higher at well less than the . 10 
level." Certain limitations of the study were rec- 
ognized and explored. The authors concluded that 
the experimental results may reflect greater moti- 
Vational effectiveness associated with the lowered 
variance of the groups. 

Comparative Methods: Films-TV. Іп ап un- 
controlled but extensive study of the New Castle text- 
films in a Canadian city, McRobbi e(102) used four 
combinations of basal and supplementary materials 
in two of which the textfilms were used basally among 
ten first-grade classes. Mean test scores among 
the treatment groups were conflicting and led to no 
“conclusive recommendations as to the merits of any 
Опе of the four reading situations. ’? Extensive sup- 
plementary data are also reported. Ina similar 
Comparison involving groups matched for age, Sex, 
intelligence, and socio-economic status, Bradley 
(19) followed two groups of 21 pupils for a three-year 
Period. At the end of the experiment with textfilms 
іп grade 1, differences favored the experimental 
&roup on the Gates Primary Reading Test, were less 
Svident in grade 2, and resulted in no significant 
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differences between the groups at theend of grade 3 
on the Philadelphia Reading Test. 

Armstrong (6) found no significant differences 
in a two-week experiment in teaching work-study 
skills by television and by conventional methods 
among three groups each from grades 5 and 6. 

Reading Skills. Hislop (76) studied the effects 
of four months of intensive training on social stud- 
ies problem-solving techniques and its transfer to 
the reading of socialstudies problems without guid- 
ing questions in groups of matched high achievers 
апа low achievers. STEP tests for social studies 
reading were given before and afterthe experiment, 
and protocols of verbalized responses were record- 
ed. Significant experimental-control differences 
were found for the experimental high achievers only 
on the STEP test and on verbalized responses, al- 
though both high and low achievers in the experi- 
mental group made highly significant gains on the 
transferproblem. Eckholz and Barbe (41) described 
а self-teaching and testing vocabulary device 
which producedsignificantly more new words learned 
in an eight-week period among experimental 
seventh-grade subjects than among the controls. 

Reading Tastes. Three library projects for 
improving the reading tastes of superior pupils in 
grades six to eight were described by Grogan (64). 
The opportunity to read and passtests upon the con- 
tents of lists of Newberry Award-winning books and 
upon selected lists of classics was shown to be sat- 
isfying to able students. 

Reading in Content Fields. Forseth (49) noted 
in examining reading pre- and post-test scores of 
tenth grade students in three Minnesota schools that 
students enrolled in geometry courses showed gains 
significantly greater in reading comprehension and 
in rate-comprehension efficiency than did students 
enrolled in biology, home economics and industrial 
arts (but not enrolled in geometry). Students were 
matched for initial reading scores, intelligence, and 
School. The author recognizes certain limitations 
in the study in a discussion of the results and their 
implications. 


Speech Training and Reading. Sommers and oth- 
ers (137) conducted an extensive study involving 
1562 first-grade children to investigate the effects 
of speech therapy and speech im provement pro- 
grams upon articulation and reading. Among the 
test results it was found that a) speech therapy did 
not significantly affect basic reading aptitudes and 
skills, b) speech improvement did significantly im- 
prove such skills in children with normal articula- 
tion and those with misarticulation, and c) a combi- 
nation of speech therapy and speech improvement 
for children with misarticulations produced signifi- 
cant differences in reading comprehension in com- 
parison to those children receiving speech improve- 
ment only. 

Audio-Visual Materials and Reading. Minsel 
(108) investigated the effects of abundant use of au- 
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dio-visual materials vs. complete absence of such 
materials upon reading achievement and habits in 
two fifth-grade classes equatedfor reading achieve- 
ment and intelligence. Significant results favoring 
the experimental group were found on each of the 
Gates Basic Reading Tests, although the control 
group exceeded the experimental group in the num- 
ber of books read. The experimental group read 
well in the time it had for reading while working 
with some 361 audio-visual aids of alltypes during 
the school year. 

Promotion Policy and Reading. Kowitz and Arm- 
strong (87) used evidence provided by longitudinal 
channels of development data to study the effects of 
promotion on reading and arithmetic achievement in 
two schools having different policies on retention of 
pupils. They found that the school with a high fail- 
ure rate had a significantly higher proportion of pu- 
pils who showed a gain of two channels in reading 
and a smaller proportion who did not change chan- 
nels. This effect was shown to be true largely for 
children in no real danger of being retained, while 
the high failure rate policy did little to help pupils 
whose reading achievement showedaloss of a chan- 
nel. 


4. Reading Improvement and Remediation. 


Special Reading Programs. Clark and Karp 
(25) reported that the average gains of 104 different 
elementary children in five years of experience with 
three-week summer reading programs were sub- 
stantial, and were considerably greater than gains 
of comparable children who had been worked with 
individually during the regular school year. The 
author attributes the difference to the change inen- 
vironment and to more complete individualization of 
work Legere and Tracey (91) and Poulton (119 
found gains in rate and comprehension in an army 
reading improvement program and in adult reading 
programs in England in 1952-59, respectively. 
Poulton reported a lower percentof gain in compre- 
hension than in reading rate among themany pro- 
grams surveyed, and discusses its implications. 

Favorable results of special reading pro- 
grams were secured for college-bound high-school 
students by Dobrin (36) and Noall (112). The latter 
study, which reports an experiment in‘‘massed dif- 
ferentiated skills instruction'' designed for self-di- 
rection, contains an extensive bibliography of skill- 
building materials for high-schooluseand describes 
the use of an intermittent timer as a pace-setting 
device. 

Pallone (115) secured significant gains from in- 
tensive reading improvement programs for both 
six-week and six-month duration uponthe verbal 
part of the Scholastic Aptitude Test incollege -bound 
senior boys in an independent school. The author 
recommended a combinationof both short- and long- 
term courses for maximum value in relation to such 
а task since such a test appears to be heavily 
weighted with factors susceptibleto improvement in 
developmental reading courses. 


Among special programs for able students, 
McCracken (96) noted substantial progress in rate 
of comprehension among a group of eight bright 
sixth-grade girls when emphasis was placed upona 
“mature approach” to reading. He found that the 
students were somewhat reluctant to accept respon- 
sibility fully and independently. Substantial gains 
were likewise reported at the tenth-grade level for 
an intensive six-weeks remedial reading program for 
experimental over control subjects in high honors 
English classes by Gowan and Scheibel (60). 

Comparative Special Reading Programs. A 
second portion of the study by Noall (112) revealed no 
significant differences between individualiz ed and 
teacher-directed group approaches ina summer- 
reading clinic program for students who varied wide- 
ly ingrade placement from grade seventhrough13. In 
a controlled study of 86 seventh-grade students, 
Walker (143) found that highly individualized in- 
struction in two groups, one using the SRA Reading 
Laboratory, was superior at the . 05% level of con- 
fidence to a non-individualized approach, Atthe 
college level, Spache,.Standlee, and Neville (138), 
using covariance analysis, secur ed no significant 
differences in reading achievement among students 
in six-weeks remedial programs in which the focus 
was upon individualized study, audio-visual presen- 
tation, and workbooks, respectively. Differ ences 
in attitude toward reading tended to favor the indi- 
vidualized instruction group. 

Martens (105) investigated the role of the pacer 
in improving reading comprehension in controlled 
groups of junior high school subjects. Analysis of 
variance and covariance showed no significant dif- 
ferences between experimental and c ont rol groups 
of above-average ability although both groups im- 
proved about two grades in one year intotal reading 
growth. The experiment suggests that other tech- 
niques than pacing reading may improve comprehen- 
sion. 


According to critical ratios, in a study con- 
trolling sex, intelligence, normality of vision, and 
initial reading achievement atthe sixth-grade level, 
Bormuth and Aker (17) found no significant differen- 
ces as a result of tachistoscopic training in reading 
achievement. Bottomly (18) reported that, at the 
end of a three-week period, the use of the Controll- 
ed Reader in grades five and eight produced, ingen- 
eral, no significant differences between experimen- 
tal pupils and those in conventional developm e ntal 
reading classes. However, five months later, he re- 
ported differences favoring the experimental group 
at the . 05% level upon all but one score on the Chi- 
cago and the Cooperative reading tests. The author 
suggests that this is a «late blooming" effect. Daly 
and Lee (29) reported no significant differences in 
homogeneous versus heterogeneous approac hes 
while working with 77 mentally retarded readers 
who showed a discrepancy between reading rate an 
mental ages attributed to WISC scores. 

In a comparison of low language achieve 


rsin five 


grade X Canadian schools, Larson, Black and 
Browne (89) found no significant differences between 
four homogeneous experimental groups and four het- 
erogeneous control groups in reading achievement, 
A specially designed curriculum was used inthe ex- 
perimental group and both groups made significant 
progress on reading and language tests selected 
from the Iowa Tests of Educational Development. 
However, the effects betweengroups were apparent- 
ly obscured by uncontrolled initial differences in the 
population of the study. 


Remediation of Severe Reading Disability. 
Delacato (32) described case studies of the treat- 
.ment and progress of severereading disability in 
children, based upon a theory of inco m plete total 
neurological development in establishing lateral 
dominance. Aspects of treatment reported effective 
include the control of posture in sleep, the avoid- 
ance of the use of the non-dominant limbs in as 
many activities as possible, andthe suppression of 
tonalactivities in reading. The study should be 
consulted for details of theontogenetic principle un- 
derlying the treatment. 

* Roman (122) reported a study of the compara- 
tive effectiveness of three group approaches—rem- 
edial reading, tutorial therapy, and interview ther- 
apy—upon improvement in oral reading and adjust- 
ment for 21 delinquent boys. Tutorial group thera- 
By produced the greatest gains in oral reading on 
the Gray Oral Paragraphs, although differences 
among the groups were not substantially significant. 
However, group tutorial t herapy produced signifi- 
cantly greater gains upon an elaborate series of psy- 
chological measures, and upon school adjustment 
according to the ratings made by social workers. 
For such a group of subjects, the author concludes 
that remedial reading is **not enough’’ and that psy- 
chotherapy **offers too little." Roman also discus- 
Ses the differences in the therapeutic processes in- 
volved in each approach. 

; Bibliotherapy. Theuse of protocols of inter- 
views before and after reading stories to a total of 
80 children to release their fears of the dark and of 
dogs led Webster (144) to conclude that bibliotherapy 
was helpful in subst antially reducing the fears of 
these children. 

5. Reading Diagnosis and Testing. 


Perceptual Factors. Koppitz, Mardis, and 
Stephens (86), seeking to identify first-grade chil- 
dren with potential reading and learning difficulties, 
found that the Bender Gestalt predicted average 
achievement in reading on the Metropolitan Reading 
Test as well as the Metropolitanor Lee-Clark read- 
ing readiness tests. The Lee-Clarktended to over- 
estimate achievement of the 272 subjects, while the 
Bender Gestalt and the Metropolitan Readiness Test 
tended to underestimate the achievement. 

NUS Lachmann (88) examinedthe following hypo- 
esis: “If reading disability reflects a lag or re- 
tardation in perceptual-motordevelopment, retard- 
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ed readers should show significantly less mature 

performance on the Bender Gestalt test than either 

a comparable group of normalreaders or a group 
of emotionally disturbed children who have normal 
reading ability. ’’ He studiedthree carefully equated 
groups of 40 pupils each, sub-divided into younger 
and older pupils averaging about eight and eleven: 
years. A multivariate information transmission 
analysis showed no interaction between age, group, 
or types of distortion on the Bender Bestalt, but 
younger groups showed significantly more distortion. 
The number of distortions among the retarded 
readers was significantly greater than in normal 
readers, but less so in comparison with the emo- 

tionally disturbed children. Frostig, Lefever, and 
Whittlesey (50) noted that five aspects of visual per- 

ception may be discrete, according to clinical find- 

ings: eye-movement coordination, figure- ground 
relationships, form constancy, spatial position, and 
spatialrelations. They describe the development 

and standardization of a Developmental Test of Vis- 

ual Perception. 

Goetzinger, Dirks, and Baer (58) conducted a con- 
ducted a carefully controlled study of 15 poor male 
readers and 15 good male readers closely matched оп 
СА, IQ, and socio-economic groupto investigate aud- 
itory discrimination and visual perception differ- 
ences. Agesrangedfrom 10-7to12-9. The Califor- 
nia Reading Test was used to appraise reading 
achievement. Specificfindings were: a) good readers 
were significantly superior to poor readers at the .01 
level on the Rush Hughes, Wepman, and the differ- 
ence score (between the W-22 andthe Rush Hughes) in 
auditory discrimination; b) “Тһе statistically sig- 
nificant difference between groups on the difference 
score (. 01 level) plus the lack of correlation be- 
tween the difference score and intelligence possibly 
is suggestive of a primary auditory-cortical dys- 
function in poor readers;'' and c) differences on the 
Raven's Progressive Matrices significantly favored 
the good readers at the . 01 level. Differences at 
this level were not found for the Figure- Ground 
Test nor the Gottschaldt Figures of visual percep- 
tion. 

Neurological Factors. Delcato (32) identified 
a constellation of seven factors, mostly relating to 
laterality, each of which were found in 40 or more 
of a group of 45 severely disabled young male read- 
ers. He describes these as universal factors inthe 
neurological realm and contrasts these to other 
commonly identified factors. 

Intellectual Factors. Parsley and Powell (119) 
noted that for 169 entering first-grade children whose 
reading readiness test scores indicated a delay of 
from three months to a year in reading instruction, 
the correlations between the Lee-Clark Reading 
Readiness Test and Binet 1075 were low, whether 
based on raw scores or grade placement. Inasec- 
ond study, Powell and Parsley (119) found that in 
testing 703 children, first-grade Lee Clark Reading 
Readiness Test scores were primarily useful in pre- 
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dicting total California Reading Test scores for the 
group atthe beginning of second grade. The authors 
question its usefulness in predicting individual per- 
formance, especially since low groupplacement ap- 
peared to be relatively unrelatedto reading achieve- 
ment atthe beginning of second grade. Turner 
(142) found that intelligence and motivation were im- 
portant in determining the level at which junior high- 
school pupils benefit from remedial reading. Ra- 
venette (120) described the use of the Crichton Vocab- 
ulary Scale in an attempt to find a better means for 
diagnosing specific intellectual ability related to 
reading among 92 young children in two English 
schools. No significant differences between mean 
reading and vocabulary scores were found. The 
author includes predicted reading scores derived 
from a regression equation and discrepancy tables 
for use with the vocabulary scale. North (113) re- 
ported correlations between a new non-verbal test 
of intelligence, the SRA Tests of General Ability 
(TOGA), with various reading tests at grades 2,5,7, 
9, and 12. Themedian correlation of . 57 was 
thought high enough to be of possible value ‘‘as опе 
basis for judging whether a pu pil's development in 
reading skills is approximately in line with his learn- 
ing ability. '' 

Three studies dealt with WISC sub-test pro- 
files and reading disability. Neville (111) found that 
among 35 pairsof male retarded and normal readers 
matched by total IQ and grade level, the retarded 
readers were significantly low at the 01% level of 
confidence in information, arithmetic, non- verbal 
IQ, and, in 22 cases, in digit span. Contrariwise, 
they were similarly high in picture arrangement, 
block design, and verbal IQ. The author relates his 
findings to the psychological functions presumed in 
the test patterns. Kallos (82) did notfind much dis- 
crepancy between his findings on the WISC profile 
for 3" boys and those of the earlier study by Altus, 
except for a high score in block design. Hirst (75) 
described for 30 poor readers the application of a 
two-way analysis of WISC subtest scores, involving 
deviations from the age mean for the subject as well 
as his score scatter. Differences in profiles for 
mildly disabled and severely disabled readers were 
found. 

Teacher Judgment. Lytton (94) discovered 
no significant differences between test-result and 
teacher-judgment bases for selecting 97 students for 
remedial reading in ‘‘adjustment centers’’ in Eng- 
lish schools. 

Family Adjustment. Applicationofthe Leary 
Interpersonal Check Listby Seigler and Gynther (126) 
to determine the relative family harmony among ten 
parents of poor readers and ten parents of good 
readers showed that high disidentification scores a- 
mong the former gave evidence of unhar monious 
family relationships. The parents of poor readers 
were found to be significantly more aggressive, 
distrustful, self-effacing, and de pendent in their 
descriptions of self, spouse, and child. 


Cultural Effects. Mazurkiewicz (106), in seek- 
ingto investigate the assumption that males may 
consider reading asa “шов у feminine” rather than 
a *mostly masculine" activity, reported somewhat 
tenuous evidence that 157 eleventh-grade boys and 
their fathers tended to view reading in the former 
vein as did the poorer readers. 

Reading and Spelling Ability. Aaron (1) com- 
pared the upper and lower 27% of fourth and eighth 
graders on the California Reading Test with respect 
to their ability to spell, to spell phonic syllables, 
and to syllabicate. Good readers achieved higher 
mean scores on all measures, but numerous indi- 
vidual differences in performance were foundin 
both good and poor readers. 


Concluding Statement à 


Research related to reading inthe previous year 
shows several promising trends. A number of the 
investigations revealed great care in experimental 
design to control significant variables. Thorough 
and scholarly analyses of related research accom- 
panied others. A willingness to examine intensively 
the significant learning processes in reading was 
likewise evident, as was cooperative endeavor a- 
mong workers representing several disciplines of 
study. As such efforts are continued, they should 
advance the understanding of reading processes, ef- 
fects, and conditions, tothe more effective utili- 
zation of reading in daily life. 4 
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XII (January 1 , 42-44. 

Questionnaire survey of 83 intermediate teachers 
in Southern California concerning their competence іп 
meeting the reading needs of pupils with IQ's over 130, 
22. Carlson, Earl R. ‘‘Psychological Satisfaction 

and Interest in News, "' Journalism Quarterly, 
XXXVII (Autumn 1960), -bi. 

Study of relationships bet we en interest ratings 
on 20 news topics and 25 perceived goal-values re- 
sponded to by 47 college students, as shown by a 
Satisfaction-Usefulness Index. 

23. Carter, Roy E., Jr., and Mitofsky, W. J. 
** Actual and PerceivedDistances in the News;' 


Journalism Quarterly, XXXVIII (Spring 1961), 
223-5. 


Comparison of actual and perceived distances 
judged by 35 students in ageography class in а South- 
ern university as afunctionof regional (North-South) 
proximity and knowledge through visitation. 

24. Cassel, Russell N., and Haddox, С. “Сотраг- 
ing Reading Competency With Personality and 
Social Insight Test 5согев,”” California Jour- 
nalof Educational Research, XII (January 
1961), 27-30. 

Correlations between measures of personality 
tension and needs, socialinsight, academic achieve- 
ment, and reading in the social studies area for 200 
ninth-grade students in Phoenix, Arizona, high 
schools. 

25 Clark, M. Phipps, and Karp, J. “А Summer 
Remedial Program,’’ Elementary School Jour- 


nal, LXI (December 1960), 137-42. 

Progress report on reading improvement in 104 
New York City children who participated in summer 
remedial reading programs of three weeks' duration. 
26. Cohn, В. ‘‘Delayed Acquisitionof Reading and 

Writing Abilities in Children—A Neurological 
Study,” Archives of Neurology, IV (February 
1961), 153-64. 

Two-year clinical study of 46 children with spe- 
cific reading and writing difficulties who were given 
detailed perceptual and neurological tests to ascer- 
tain if delayed development in the use of graphic 
symbols for communication purposes is primarily 
an expression of general disturbance in neurological 
function. 

27. Coleman, J. H., and Jungeblut, Ann. ‘‘Chil- 
dren's Likes and Dislikes About What They 
Read,” Journal of Educational Research, LIV 
(February 1961), 221-8. 

Survey of reading likes and dislikes of 750 chil- 
dren in grades 4-6 inthree eastern states, expressed 
on a six-point rating scale for 81 short prose selec- 
tions, each reproduced on a single page. 

28. Dale, Edgar, et al. Children’s Knowledge of 
Words, Columbus, Ohio; Ohio State Univer- 
ity Bureau of Educational Research and Ser- 
vice, 1960. 

Compilation of word lists for grades 4, 6, 8, 10, 
and 12, based upon familiarity scores of 50% de- 
rived by tests given to samples of 200 or more at 
each grade level. 

29, Daly, William E., and Lee, В. H. ‘‘Reading 
Disabilities in a Group of Mentally Retarded 
Children: Incidence and Treatment," Training 
School Bulletin, LVII (November 1960), 85-93. 

Study of incidence of reading disability, meas- 
ured as a discrepancy between reading speed and 
mental age, and effects of homogeneous versus 
hetrogeneous reading instruction for a group of insti- 
tutional mental retardates. 

30. Davis, Norris G., and Watkins, S. ''Teen-Age 
Newspaper Reading," Editor and Publisher, 
ХСІП (July 23, 1960), 13, 

Newspaper reading survey of the newspaper read- 
ing habits of 4493 Texas teen-agers. 

31. Davis, Norris G., and Watkins, S. “Тееп- 
Agers’ Image of the Press, ” Editor and Pub- 
lisher, XCIII (July 30, 1960), 14. 

Newspaper reading survey of the image of news- 
papers and newspapermen held by 4493 Texas teen- 
agers. 

32. Delacato, Carl H. The Treatment and Preven- 
tion of Reading Problems. Springfield, Tli- 
nois: Charles C. Thomas, 1959. Pp. 122. 

Presentation of case histories illustrating the 
treatment and progress of children with severe read- 
ing disabilities, based upon an analysis of develop- 
mental and neurological symptoms. 

33. Deutschmann, Paul J., and Danielson, W- А. 
‘Diffusion of the Major News Story," Jour- 
nalism Quarterly, XXXVII (Summer ) 


| 


345-55. 

Telephone interview study of the nature and speed 
of diffusion of knowledge about three major news 
stories, the six population samples totalling 844 a- 
dults in three communities in 1957-58. 

34. Diack, Hunter. Reading and the Psychology of 
Perception. New York: Philosophical Library, 
1960. Pp. 155. 

Summary and review of various researches con- 
ducted by the author and others into the comparative 
effectiveness of early phonic versus whole ap- 
proaches in teaching reading, and the assump- 
tions about the basic perceptual unit underlying each. 
35, Diener, Thomas F., and Kaczkowski, H. R. 

“Readability of Occupational Informa t ion,” 
Vocational Guidance Quarterly, IX (Winter 
1960-1961), 87-9. 

Appraisal of three different types of occupation- 
al literature by the Yoakam readability formula. 

36. Dobrin, Ruth M. “Тһе Massapecqua Story,” 
Journal of Developmental Reading, IV (Spring 
1961), 159-72. 

Description of reading program for college- bound 
high school students and its effects on College Board 
Scores and gains on the Cooperative Test of Read- 
ing Comprehension. 

37. Doehring, Donald G. , and Rosenstein, J. **Vis- 
ual Word Recognition by Deaf and Hearing 
Children, ” Journal of Speech and Hearing Re- 
search, III (December 1960), 320-6. 

Comparative study of visual recognition of let- 
ters, trigrams and words in 40 orally-trained deaf 
children and 40 hearing children, in age groups of 
nine to eleven and twelve to 16. 

38. Durkin, Dolores. ‘Children Who Read Before 
Grade One," Reading Teacher, XIV (January 
1961), 163-6. 

Progress report on two years of a study of 49 
California children who learned to read before en- 
tering school. 

39. Dyer, Dorothy W., and Harcum, Е. В. <Уіз- 
ual Perception of Binary Patterns by Pre- 
School Children and by School Children," 
Journal of Educational Psychology, LII (June 
1961), 161-5. 

Tachistoscopic study of significance of differen- 
Ces in accuracy of perception tothe right and left of 
a fixation point for 31 pre-school and kindergarten 
children and 35 children in grades one and two, with 
implications for reading. 

0. Eames, Thomas Н. “Accommodation in School 
Children Aged Five, Six, Seven, and Eight 
Years,'' American Journal of Ophthalmology, 
LI (June 1961), 1255-7. 

Analysis of near-point accommodation adequacy 
of 899 young urban and suburban children with re- 
Spect to their visual maturity for school reading 
tasks, 

41. Eickholz, G., and Barbe. R. “Ап Experiment 
in Vocabulary Development, ” Educational Re- 
Search Bulletin, XL (January 1961), 1-7, 28. 
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Analysis of the effectiveness of a se lf-teaching 
and testing vocabulary device in two experimental 
and two control seventh-grade classes in promoting 
the learning and retention of word meanings. 

42. Edelstein, Alex S., and Larson, O. М. “Тһе 
Weekly Press’ Contribution to a Sense of Ur- 
ban Community, ” Journalism Quarterly, 
XXXVII (Autumn 1960), 489-98. | 

Study of the role of a free urgan weekly newspa- 
per in a large metropolitan ar ea as a communica- 
tion system and a means of community identifica- 
tion, based upon interviews with 735 adults selected 
by area cluster sampling techniques. SeealsoLar- 
son and Edelstein (90). 

43. Edfeldt, A. W. Silent Speech andSilent Read- 
ing, Chicago: University of Chicago Press, 
1960. Pp. 164. 

Experimental study of the relationship between 
reading ability and the occurrence of silent speech 
as shown by an analysis of electromyographic гес- 
ords of 84 entering students at the University of 
Stockholm. % 

44. Educational Records Bulletin No. 78, 1960 Fall 
Testing Programs in Independent Schools and 
Supplementary Studies, February 1961. New 
York: Educational Records Bureau. Pp. 76. 

Annual summary and correlation analysis of a 
comprehensive public and private school achieve- 
ment testing program, including reading achieve- 
ment, for elementary and secondary schools. 

45. Ervin, Susan M., and Foster, G. “ТһеБеуеі- 
opment of Meaning in Children's Descriptive 
Terms, ” Journal of Abnormal and Social Psy- 
chology, LVI (September 1960), 271-5. 

Comparative study of ability of children of aver- 
age intelligence, 32 in grade one and 36 in grade six, 
to identify correctly physical attributes (heavy, big, 
strong) of objects and personal attributes (clean, 
pretty, happy) of pictured faces. 

46. Falconer, George А. “А Mechanical Device 
for Teaching Sight Vocabulary to Young Deaf 
Children,” American Annals of the Deaf, CVI 
(March 1961), 251-7. xc 

Description of effectsof five minutes of practice 
for ten days in learning 15 nouns on a Skinner-type 
teaching machine for eight six- and seven-year -old 
deaf children. 

47. Flavell, J. H., and Stedman, D. J. “А De- 
velopmental Study of J udg ments of Semantic 
Similarity, '" Journal of Genetic Psychology, 
XCVIII (June 1 , 219-93. 

A descriptive cross-sectional study of the ability 
of 160 children of average intelligence, ten boys and 
ten girls of each of grades two through nine, to 
rank-order word-pairs in terms of their semantic 
similarity according to eleven categories of logico- 
grammatical relationships. 

48. Flores, Ivan. ‘‘Methods for Comparing the 
Legibility of Printed Numerals,” Journal of 
Psychology, L (July 1960), 3-14. 

Comparison of effects of rigorous control of ex- 
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posure time and conditions upon readability of five 

type faces, including one designed to be ‘‘read’’ by 

a machine. 

49. Forseth, William J. ‘‘Does the Study of Geo- 
metry Help Improve Reading Ability?" Math- 
ematics Teacher, LIV (January 1961), 12-13. 

Comparative effects of course work in geometry, 
biology, home economics, and industrial arts upon 

reading achievement on the Schrammel-G ra y High 
School and College Reading Test by tenth grade stu- 
dents initially matched for intelligence and reading 
ability. 

50. Frostig, M., Lefever, D. W., and Whittlesey, 
J. R. B. “А Developmental Test of Visual 
Perception for Evaluating Normal and Neuro- 
logically Handicapped Children,’’ Perceptual 
and Motor Skills, XII (June 1961), 383-94. 

Development and standardization of a develop- 
mental test in-five areas of visual perception for a 
sample of 434normal children, 3-1/2 to 8 years old, 
and 71 children diagnosed as neurologically handi- 
capped, ^ 
51. Gaier, Eugene L., and Collier, M. J. “Тһе 

Latency-Stage Story Preferences of American 
and Finnish Children, " Child Development, 
XXXI (September 1960), 431-51. 

Cross-cultural study of story preferences, in- 
cluding sex differences, of 199 American children 
with an average СА of 10. 0 and 120 Finnish children 
with an average age of 11.1. 

52. Gates, Arthur 1. Reading Attainment in Ele- 
mentary Schools; 1957 and 1937. New York: 
Bureau of Publications, Teachers College, 
1961. Pp. 26. p 

Comparison of reading attainment in 1937 and 
1957 on the Gates Reading Tests. 

53. Gates, Arthur I. ‘‘Results of Teaching a Sys- 
tem of Phonics,” Reading Teacher, XIV 
(March 1961), 248-52. 

Analysis of reading progress in relation to ac- 
complishment quotients, years of phonics instruction 
and intelligence, of 267 pupils in 12 classes taught 
by anisolated phonics approachin grades 3-5 in four 
New York schools. 

54. Gates, Arthur I. ‘‘Sex Differences in Reading 
Ability," Elementary School Journal, LXI 
(May 1961), 431-4. , 

Analysis of the significance of sex differences on 
the Gates Reading Survey Test for6,646 boys and 
6, 468 girls in grades two through eight in twelve 
school systems and ten states in 1957. 

55. Ghent, Lila. ‘‘Form and Its Orientation:A 
Child's Eye View,” American Journal of Psy- 
chology, LXXIV (June 1), 171-90. 

Two laboratory experiments, involving 118 
children aged four to eight, of children's judgment 
of non-realistic, geometricfigures as upside down. 
56. Ghent, Lila. ‘‘Recognition by Children of Re- 

alistic Figures Presented in Various Orienta- 
tions," Canadian Journal of Psychology, XIV 
(December 1960), 249-56. E 


Laboratory experiment, involving 69 children 
aged three to seven, to test assumption that chil- 
dren do not have a preferred orientation for recog- 
nizing realistic figures. 

57. Ghent, Lila, and Bernstein, L. Influence of 
the Orientation of Geometric Forms on Their 
Recognition by Children," Perceptualand 
Motor Skills, XII (March 1961), 95-101. —— 

Laboratory study of ability of 40 pre-school 
children to recognize sim ple geometric forms in 
various orientations. 

58. Goetzinger, C. P., Dirks, D. D., and Baer, 
C. J. “Auditory Discrimination and Visual 
Perception in Good and Poor Readers," An- 
nals of Otology, Rhinology and Laryngology, 
LXIX (Marc $ -36. 

Controlled study comparing auditory and visual 
discrimination in 15 male poor readers and 15 male 
good readers with а CA range of 10-7 to 12-9. 

59. Goldberg, Н. K., Marshall, C., andSims, E. 
**The Role of Brain Damage in Congenital 
Dyslexia,” American Journal of Ophthalmol- 
ogy, L (October 3 -90. 

Comparison of electroencephalographic records 
of 25 severely retarded readers and 25 normally a- 
chieving readers in Baltimore County schools with 
respect to possible sub-clinical brain damage. 

60. Gowan, J. C., and Scheibel,R. W. ‘‘Improve- 
ment of Reading in Gifted Children, ’’ Educa- 
tional Administration and Supervision, XLVI 
(January 1960), 35-40. 

Results of intensive six-week remedial reading 
program with 34 experimental and 27 control tenth 
grade students in honors English classes in a sub- 
urban California high school. 

61. Greenberg, Bradley S. Additional dataon 
Variables Related to Press Freedom," Jour- 
nalism Quarterly, XXXVIII (Winter 1961), 76- 
g 


Correlation analysis of the effectiveness of four 
variables—literacy rate, per capita income, daily 
circulation per thousand, and number of daily news- 
papers—in estimating press freedom in 32non- 
Communist bloc nations according to degree of gov- 
ernment restriction in 1956-58. 

62. Groelle, M. C. “Some Results and Implica- 
tions of Reading Survey Tests Given to Edu- 
cable Mentally Retarded C hildren,’’ Excep- 
tional Children, XXVII (April 1961), 443-8. 

Comparative survey of reading ac hi evement of 
educable mentally retarded children, 971 in 1956 
and 1,179 in 1959, of elementary and high school 
age in the Oakland, California schools. 

63. Groff, Patrick 2. “Тһе Problem of Vocabu- 
lary Load in Individualized Reading, ” Read- 
ing Teacher, XIV (January 1961), 1 88-90, 
194. 

Compilation ofthe number of new words 
introduced in pre-primers and primary readers in 
12 basal reading series. 

64. Grogan, AgnesC. “Тһе School Library and the 


Able Readers: Library Channels Reading 
Tastes of Upper Grade Pupils," Chicago 
Schools Journal, XLII (October 1960), 30-3. | 

Description of three projects in a school library 
to encourage the development of taste in reading of 
superior pupils in grades 6-8 who read and passed 
tests upon a number of a ard-winning books and 
classics. 

65. Gunderson, Richard O., and  Feldt, D. A. 
“Тһе Relationship of Differences Bet ween 
Verbal and Non-Verbal Intelligence Scores to 
Achievement, " Journal of Educational Psy- 
chology, LI (June > 115-21. 

Analysis of variance study of achievement on the 
Iowa Tests of Basic Skills among four groups of 25 
fourth-grade children, each having different pat- 
terns of verbal and non-verbal scores but matched 
total IQ scores, on the California Short-Form Test 
of Mental Maturity; with supplementary data from 
teacher observation. 

66. Haag, Carl H., Sayles, D. G., and Smith, D. 
E. P. ‘‘Certificate Requirements for Reading 
Specialists, ’? Reading Teacher, XIV (Novem- 
ber 1960), 98-1 

Survey of certification requirements for reading 
Specialists in 46 states of the United States. 

67, Harootunian, Вегј. *ANote on Reading and 
Closure,"' Reading Teacher, XIV (May 1961), 
343-4. 

Correlation analysis of relationships between 
reading achievement andtests of speed and flexibil- 
Чу of closure administeredto 208 students in grades 

-8. 

68. Hartman, Frank R. “Recognition Learning 
Under Multiple Channel Presentation and 
Testing Conditions, ’’ Audio-Visual Communi- 
cation Review, IX (January-February 1961), 

Experimental study of comparative results of 
presenting information by single, dual, or triple 
modes of presentation in 43 combinations of audio, 
pictorial, and print channels in three experiments 
With 1, 184 university freshmen. 

69. Haskins, Jack B. ‘‘Title Rating: A Method 
for Measuring Reading Interests and Predict- 
ing Readership, ” Educational and Psycholog- 
ical Measurement, XX (Fall › -09. 

Experimental study involving a national sam ple 
of 497 persons, to validate the usefulness of **ther- 
mometer”’ and verbal scales in predicting actual 
readership of magazine articles by rating titles. 
Also appeared in Journalism Quarterly, XXXVII 
(Spring 1960), 224-30, and reviewed in Journal of 
Educational Research (February 1961). ~~ 
"0. Haskins, Jack B. ‘‘Validation of the Abstrac- 

tion Index as a Tool for Content-Effects An- 
alysis and Content Analysis,” Journal of Ap- 
plied Psychology, XLIV (April › -9. 

Validation study of Gillie's Abstraction Index (AI) 
against behavioral criteria of reader behavior and 
Satisfaction in reading selected portions ofan issue 
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of the Saturday Evening Post, based upon a nation- 

wide sample of 340 persons. 

71. Hemmerling, Robert L., and Hurst, H. “Тһе 
Effects of Leisure Time Activities on Scholas- 
tic Achievement, ” California Journal of Edu- 
cational Research, XII (March 1), 86-90. 

Questionnaire survey of 202 California hig 
School sophomores reporting the extent of nine lei- 
sure-time activities, including reading, and the sig- 
nificance of their correlationto academic success. 
72. Herculane, Sister M. “А Survey of the Flexi- 

bility of Reading Rates and Techniques Ac- 
cording to Purpose, ’’ Journal of Developmen - 
tal Reading, IV (Spring 1961), -10. 

Investigation of extent to which 102 eighth-grade 
pupils varied their rate and techniques of reading 
according to purpose and displayed understanding of 
the concept of flexibility in reading rate. 

73. Hermann, Knud. Reading Disability: A Medi- 
cal Study of Word- Blindness and Related Han- 
dicaps, Springfield, Illinois: Charles С. 
Thomas, 1960. Pp. 183. 

Clinical analyses of children and adults diagnos- 
ed as congenitally word-blind at the Word- Blind In- 
stitute in Copenhagen, Denmark. 

74, Hinze, Helen К. “Тһе Individual’s Word Asso- 
ciations and His Interpretation of Prose Par- 
agraphs,’’ Journal of General Psychology, 
LXIV (January 1961), 193-203. 

Chi-square analysis of nature and direction of 
Íree associative responses inthe interpretation of 
key stimulus words in two paragraphs, among 25 
freshmen English students ina California state col- 
lege. 

C Hirst, Lynne S. “Тһе Usefulness of a Two- 
Way Analysis of WISC Sub- Tests in the Diag- 
nosis of Remedial Reading Problems,’’ Jour- 
nal of Experimental Education, XXIX (Decem- 
ber 1960), 153-60. 

Analysis of WISC sub-test profiles for 21 severe- 
ly disabled and 18 mildly disabled readers of nor- 
mal intelligence with CA's between 8-0 and 13-6. 
76. Hislop, George R. “А Study of Division Two 

Social Studies Reading Skills, " Alberta Jour- 
nal of Educational Research, VII (March 1961), 
УА: ПШ ОТУИ AE S 


Comparative study of 28 high versus 28 low a- 
chievers in experimental and control groups to ex- 
amine effects of a four-month program in problem- 
solving techniques in social studies re ading in 
grades four, five and six. 

77. Hoar, Jere. “Book Reading in the Senior 
Years; The Habits and Preferences of 200 
Мізвізвірріапв,”” Journal of Educational So- 
ciology, XXXIV (November 1960), 137-44. 

Questionnaire survey of reading practices and 
preferences of 200 people over age 60 ina rural 
Mississippi town. 

78. Howell, William C., and Kraft, C. L. “Тһе 
Judgment of Size, Contrast, and Sharpness of 
Letter Forms," Journal of Experimental 
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Psychology," LXI (January 1961), 30-9. 

Analysis of three experimental studies t o quan- 
tify and measure effects of image Size, blur, and 
contrast upon recognition of letters of the alphabet. 
79. Hvistendahl, J. К. ‘‘Headline Readability Mea- 

sured in Context, ? Journalism Quarterly, 
XXXVIII (Spring 1961), -8. 

Comparison of speed of reading by 80 college un- 
dergraduates of two groups of 50 one-column, 
three-line headlines of the same point size placed 
in normal context, one group of headlines set in 
capital and lower case letters, andtheother in low- 
er case letters only. 

80. Ingram, Vivien. ‘‘Flint Evaluates Its Primary 


Cycle," Elementary School Journal, LXI(No- 
vember 1960), 76-80. 

Uncontrolled study of reading achievement of 68 
third-grade pupils in an ungraded primary in com- 
parison with 3651 other graded pupils in a Michigan 
city. 

81. Jones, Reginald L., Grass, F. P., and Van 
Why, Е. L. “А Longitudinal Study of Reading 
Achievement in a Group of Adolescent Institu- 
tionalized Mentally Retarded Children," 
Training School Bulletin, LVII (August 1960), 
41-7. 

Growth in reading achievement shown by analy- 
sis of variance on California Reading Test, by 42 
mental retardates, aged 16-18 years, inan Ohio in- 
stitution. 

82. Kallos, George L., Grabow, J. M., and Guar - 
ino, А. “Тһе WISC Profile of Disabled Read- 
ers," Personnel and Guidance Journal, 
XXXIX (February 1961), 476-8. 

Analysis of the significance of differences in WISC 
subtest scores of 37 boys, aged 9-14 and with IQ's 
of 90-109 to determine the value of the WISC pro- 
files in predicting reading disability. 

83. Kearl, Bryant, and Powers, В. D. ‘‘Estimat- 
ing Understanding of Scientific Terms," Jour- 

nalism Quarterly, XXXVIII (Spring1961), 221 
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Comparison of difficulty ratings ofa test of soils 
terminology or knowledge by 102 Wisconsin farmers 
and ratings by groups of 39 soil res earch special- 
ists, 92 soils extension workers, and 108 agricul- 
tural journalists in several states. 

84. Kerfoot, James F. “Тһе Vocabulary in Pri- 
mary Arithmetic Texts,” Reading Teacher, 
XIV (January 1961), 177-80. Ps 

Compilation of a first- and second-grade arith- 
metic word list from six current arithmetic text- 
book series, coded to the Gates and the Dale pri- 
mary vocabulary lists. 

85. Kolson, Clifford J., and Fullmer, W. “А Be- 
ginning Reading УосарШагу,”” Journal of Ed- 
ucational Research, LIV (March 1961), 270-2. 

Tabulation of a basic word list of 184 words ac- 
cording to frequency in 11 basal readers published 
jetween 1954-59 and compared to the Dolch and In- 
ernational Kindergarten Union lists. 


86. Koppitz, Elizabeth M., Mardis, V., and Stephens, 
T. “A Note on Sc т eening School Beginners 
with the Bender Gestalt Test, ’’ Journal of Ed- 
ucational Psychology, LII (April T 1961), 80-1. 

Correlations at the beginning and end of first 
grade scores of 272 pupils of the Bender Gestalt 
Test with either the Lee-Clark or Metropolitan 
readiness test scores, to determine the predictive 
value of the Bender Gestalt test. 

87. Kowitz, G. T., and Armstrong, C. M. “The 
Effects of Promotion Policy on Academic A- 
chievement, ” Elementary School Journal, 
LXI (May 1961), 

Comparative effects upon reading and arithmetic 
achievement in a school with a high retention rate 
and one with a low retention rate as indicated by 
longitudinal channels of developmental data. 

88. Lachmann, Frank M. ‘‘Perceptual Motor De- 
velopment in Children Retarded in Reading 
Ability," Journal of Consulting Psychology, 
XXIV (October -31. 

Comparative study controlling reading ability and 
intelligence of performance on the Bender Gestalt 
Test of 40 retarded readers, 40 normalreaders; 
and 40 emotionally disturbed but norm al readers 
from ages 8-12. 

89. Larson, Howard L., Black, "p. B., and 
Browne, M. P. “An Evaluation of an Exper- 
imental Method of Teaching Language to Low 
Ability Grade X Students, ’’ Alberta Journal of 
Educational Research, (September 
-75. D 

One-year comparative study of the improvement 
of low language achievers in five grade ten Canadi- 
an schools organized into four experimental homo- 
geneous groups and four heterogeneouscontrol 
groups, upon the Iowa Tests ofGeneral Vocabulary, 
Interpretation of Literary Materials, and Correct- 
ness of Expression. 

90. Larson, Otto N., and Edelstein, A. 5, ‘‘Come 
munication Consensus and the Community 
Involvement of Urban Husbands and Wives," 
Acta Sociologica, V (1960), 15-30.. 

Area cluster sampling study of readership orien- 
tation to a free urban weekly n e ws paper and to its 
effects upon community orientation determined by 
independent interviews of 267 husband-wife pairs in 
alarge metropolitan area. See also Edelstein and . 
Larson (42). 

91. Legere, JohnC. L., and Tracey, W. R. 
“Reading Improvement in an Army Service 
School," Journal of D e ve lopmental Reading, 
IV (Autumn , 41-6. 

Descriptions of methods and results of a 20-hour 
reading improvement program at a U. S. Security . 
Agency Training Center and School. 


92. Lyle, Jack. ‘‘Semantic Differential Scales for 
Үдере» Research, ’? Journalism Quarter- - 
ly, XXXVII (Autumn 1960), -62, 
Description of the derivation of 20 semantic ‘dif- 
ferential scales to measure attitudes toward a news" 


paper and the face validation of the scales against 

the Stanford test of newspaper attitudes. 

93. Lynn, Richard. **Individual Differences іп In- 
troversion- Extraversion, Reactive Inhibition, 
and Reading Attainment, ”?” Journal of Educa- 
tional Psychology, LI(December 1960), 318-21. 

Study involving tests of reminiscence and atten- 
tion given to 82 English children aged 8-11, to de- 
termine if reading attainment is significantly corre- 
lated with introversion andthe tendency to generate 
reactive inhibition slowly. 

94. Lytton, H. “Ап Experiment in Selection for 
Remedial Education,'' British Journal of Ed- 


ucational Psychology, XXXI (February 1961), 
10-94, =з. Sateen 


Predictive study of the relative merit of select- 
ing 97 elementary pupils for remedial reading on a 
test-result or teacher-judgment basis as shown by 
average gains in three *fadjustment" centers in 
England. 

95. McConville, Carolyn B. “Напдейпевв and Psy- 
chomotor Skills,’’ Jour nal of Developmental 
Reading, IV (Autumn , 41-52. 

Analysis of significance of difference in perfor- 
mance of 110 introductory psychology students, 
classified reSpectively as right or left handed, 
ambidextrous, and functionally mixed dominant, up- 
оп tests or reading rate and comprehension, adapt- 
ability and reaction time. 

96. McCracken, Robert A. “Accelerating the 
Reading Speed of Sixth Grade Gifted Child- 
геп,” ExceptionalC hildren, XXVII (Sep- 
tember › 21-8. 

Results of accelerated Speed reading program 
emphasizing the «responsibility required of a mature 
reader” for eight sixth-grade girls of 120 IQ or 
better, 

97. MacGinities, Walter H. ‘(Contextual Con- 
¿straint in English Prose Pas sages, ’’ Jour- 

nal of Psychology, LI (January 1961), 121- 
390 


Effect of omission of words (singly, inpairs, and 
in groups;of four) at different intervals in two short 
Passages of literary prose upon the ability of 600 
College students to restore the omitted words. 

98. McGinnis, Dorothy J., et al. ‘Instructional 
Problems in Reading as Viewed by Teachers 
and Administrators,’’ Reading Teacher, XIV 
(November 1960), 75-80, 114. 

Analysis of inventory completed by 549 elemen- 
tary teachers and 54 schooladministrators in Mich- 
1gan concerning problems encountered in the teach- 
ing of reading and the kinds of in-service training 
Programs provided by their schools. 

39. McGuire, William J. “ Cognitive C o nsistency 
and Attitude Change," Journal of Abnormal 
= Social Psychology, LX (May 1960), 345 
-53. А 

Study of shifts in consistency of opinions on sev- 
eral logically related issues by 120 college students 
asa result of reading persuasive statements pre- 
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sented by a Socratic method of argument and subse- 

quently read on one of these issues. 

100. McGuire, William J. ‘Direct and Indirect Ef- 
fects of Dissonance-Producing Messages, ’’ 
Journal of Abnormal and Social Psychology, 
LX (May я = 

Study of immediate and delayed retention effects 
among 35 college students in consistency of opinion 
on issues, before and after reading persuasive 
messages, as assessed by truth ratings upon rele- 
vant syllogisms with desirable andundesirable 
premises. 

101. MacKinnon, A. R. How Do Children Learn to 
Read? Toronto: Copp Clark Press, 1959. 
Pp. 276. 

Analysis of adetailed experimental study of ef- 
fectiveness of beginning reading materials designed 
by І. A. Richards and C. M. Gibson to control lang- 
uage patterns and used by a small number of five- 
year-old individuals and groups in Edinburgh, Scot- 
land. 

102. McRobbie, K. A. “Тһе Toronto New Castle 
Textfilm Reading Experiment,'' Ontario 
Journal of Educational Research, IIT (April 

j í 

Comparison of effectiveness of textfilms and 
conventional reading materials for 352 first-grade 
pupils in ten classes in five Toronto schools. 

103. Major, Alexander G., and Collette, A. T. “Тһе 
Readability of College General Biology Text- 
books," Science Education, XLV (April 
1961), 216-24, 

Analysis of vocabulary load, comprehension 
level, and readability of the ten most used college 
biology textbooks in 1953-54, accordingtothe Flesch 
reading ease formula, 

104. Manis, Melvin. “Тһе Interpretation of Opinion 
Statements as a Function of Recipient Atti- 
tude,” Journal of Abnormal and Social Ps 
chology, LX (May 1960), 340-1. 

Investigation of the relationship between differ- 
ences in reader attitude of 70 psychology students 
toward a topic (fraternities) and interpretation of 
opinions expressed in statements pro, anti and neu- 
tral to this topic. 

105. Martens, Mary. “Тһе Role of a Pacer in Im- 
proving Comprehension, ” Journal of Devel- 
opmental Reading, IV (Winter 1961), 135-9. 

Variance analysis of improvement and retention 

in comprehension skills in a controlled study of 24 

experimental and 24 control junior high school sub- 

jects over a school year, the experimental subjects 
using a reading pacer twice a week for ten-minute 
periods. 

106. Mazurkiewicz, Albert J. ‘‘Social-Cultural In- 
fluences and Reading, ’’ Journal of Develop- 
mental Reading, III (Summer 1960), 254-63. 

Study of the correspondence of classification by 

157 eleventh-grade boys and their fathers of reading 

as **mostly masculine" or ‘‘mostly feminine" and 

in relation to the sons’ reading ability and curricu- 
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lum choice. 

107. Mersand, Joseph. Attitudes Toward English 
Teaching, Philadelphia: Chilton Company, 
1961. Pp. 363. 

Detailed analysis of free-response questionnaire 
replies from 1250 persons representing educational, 
business, press, legal, librarian, governmental and 
other attitudes toward the present status and future 
needs of the teaching of English. Includes some 
references to reading. 

108. Minsel, Clara. “ Audio-visual Materials and 
Fifth Grade Reading Achievement, ”” Science 
Education, XLV (February 1961), 86-8. 

Controlled nine-month study of the effects of ex- 
tensive vs. no use of audio-visual aids in two fifth- 
grade classes of 25 pupils each, upon reading a- 
chievement on the Gates Basic Reading Tests and 
upon number of books read. 

109. Morgan, Elmer F., Jr. and Stucker, G. R. 
“Тһе Joplin Plan of Reading vs. а Tradi- 
tional Method, " Journal of Educational Psy- 
chology, LI (Apri Е 

Comparative study of the effectiveness ofthe 
Joplin plan in grades five and six for four experi- 
mental and four control groups matched as high a- 
chievers and low achievers in reading. 

110. Meuhl, Siegmar. “Тһе Effects of Visual Dis- 
crimination Pretrainingon Learning to Read 
a Vocabulary List in Kindergarten Children,” 
Journalof Educational Psychology, LI (Aug- 
ust 1 5 -21. 

Discriminationtraining experiment with 37 kin- 
dergarten pupils in learning to read a vocabulary 
list as a result of matched pretraining in geometric 
forms, different words, orthe same words appear- 
ing on the list. 

111. Neville, Donald. “А Comparison of the WISC 
Patterns of Male Retarded and Non-Retard- 
ed Readers,"' Journal of Educational Re- 
search, LIV (January 1961), zm. 

Comparisonof the significance ofthe differences 
in WISC subtest scores for 35 male retarded read- 
ers and normal readers matched by total IQ and 
grade level. 

112. Noall, Mabel S. ‘‘Automatic Teaching of Read- 
ing Skills in High School," Journal of Educa- 
tion, CXLIII (February 1961), 1-73. 

Two studies of significance of gains through (a) 
**massed differentiated,” and (b) self-directed vs. 
teacher-directed r eading skills instruction among 
able high school students. 

113. North, Robert D. *An Appraisal of the SRA 
Tests of General Ability Based on Independent 
School Results,” Educational Records Bulle- 
tin, No. 78 (February 1 , 68-76. 

Correlations between TOGA IQ's and scores on 
reading tests in grades 2,5, 7,9, and 12. 

114. Pace, Zietta S. ‘‘Flesch Formulas Applied to 

Current Readings,’’ Psychological Record, 
XI (April 1961). 
Analysis of reading difficulty and human interest 


factors, according to Flesch readability formulas, 

in samples from five current books of readings in 

psychology. 

115. Pallone, Nathaniel J. ‘‘Effects of Short- and 

Long-Term Developmental Courses upon S. 
А.Т. Verbal Scores,"' Personnel and Guid- 
ance Journal, XXXIX (April 1961), 654-7, 

Effects of intensive six-week and six-month de- 
velopmental reading programs upon the verbal part 
of the Scholastic Aptitude Test among twelfth-grade 
boys in an independent preparatory school. 

116, Parsley, К.М. Jr., and Powell, M. ''Rela- 
tionships Between the Lee-Clark Reading 
Readiness Test and the 1937 Revision of the 
Stanford Binet Intelligence Test, Form L,” 
Journalof Educational Research, LIV (April 
1961), 304-7. 

Correlation analysis of the relationship between 
Binet IQ’s and Lee-Clark Reading Readiness Test 
scoresfor 169 entering first-grade pupils whose 
readiness test scores indicated that reading instruc- 
tion should be delayed from 3-12 months. 

117. Porterfield, O. V., and Schlichting, H. F. 
“Peer Status and Reading Achievement,” 
Journal of Educational Research, LIV (April 


“ 


А б 
Chi-square analysis of relationships between 
peer status and reading achievement among 981 sixth 
grade pupils from families of high, middle, and low 
socio-economic status, enrolled in 15 schools. 

118. Poulton, E. C. “British Courses for Adults on 
Effective Reading, '' British Journal of Educa- 
tional Psychology, XXXI (June 1961), 128-31. 

Survey of percent of gain in reading rate and 
comprehension in adult reading programs in England 
between 1952 and 1959, chiefly with Harvard reading 
materials. 

119. Powell, Marvin, апа Parsley, K. “Тһе Rela- 
tionship Between First Grade Reading Read- 
iness and Second Grade Reading Achieve- 
ment, ” Journal of Educational Research, 
LIV (February j -33. 

Correlation between first-grade scores on the 
Lee-Clark Reading Readiness Test and beginning | 
second-grade scores on the California Reading Test 
for 703 children in schools in an Ohio city. 

120. Ravenette, A. Т. “Ап Empirical Approach to 
the Assessment of Reading Retardation Vocab- 
ulary Level and Reading Attainment," British 
Journal of Educational Psychology, 
February › 96-103. : 

Application of the Crichton Vocabulary Scale, in 
which words are presented orally for definition 10 
lieu of IQ tests, to predict reading achievement in 
92 children selected randomly in grades 1, 2 and 
in two English schools. К 
121. Robinson, Helen M. ‘‘Summary of Investiga- 

tions Relating to Reading, July 1, 1959to June 
30, 1960,” Journal of Educational Research, 
LIV (February 1961), 205-20. 


Annotated bibliography and summary ofa BEE 


research related to reading. 

122. Roman, Melvin. Reaching Delinquents Through 
Reading, Springfield, Illinois: Charles С. 
Thomas, 1957. Pp. 125. 

Comparative clinical study of the effectiveness of 
three approaches— group remedial reading, tutorial 
group therapy, and intensive group therapy—in im- 
proving reading achievement in21 male delinquents, 
aged 13-16, who were severely disabled readers. 


123. Roth, Robert M. “Тһе Role of Self-Concept 
in Achievement, " Journal of Experimental 
Education, XXVII (June 1959), -81. 

Analysis of relationships between the self-con- 
cept and achievement in a voluntary readingim- 
provement program, using rate-training films, for 
45 male and nine female university freshmen class- 
ified as Improver, Non-Improver and Attrition 
groups. 

124. Rothrock, Dayton C. “Heterogeneous, Homo- 
geneous, or Individualized Approachto Read- 
ing?' Elementary English, XXXVIII (April 
1961), 233-5. 

Analysis of co-variance study, controlling intel- 
ligence, previous reading achievement, and Sex, of 
comparative reading gains in one year on the Iowa 
Every Pupil Test of Basic Skills in reading compre- 
hension and work study skills for 186 fourth and 
fifth grade pupils grouped into three instructional 
patterns of four classes each. 

125. Schramm, Wilbur, Lyle, J., and Parker, E. 
B. Television in the Lives of Our Children. 
Stanford, California: S ord University 
Press, 1961. Pp. 334. 

Analysis of the effects of televiewing upon child- 
ren and upon certain aspects of their use of print in 
1958-t0, based upon eleven research studies in ten 
American and Canadian communities and involving 
data from 5991 children, 1958 parents, and several 
hundred members of school staffs. 

126. Seigler, Hazel G., andGynther, M. D. **Read- 
ing Ability of Children and Family Harmony," 
Journal of Developmental Reading, IV (Autumn 

› 17-24, 

Chi-square analysis of ratings of self, spouse, 
child and ideal on the Leary Inter personal Check 
List by ten parents of poor readers and ten parents 
of good readers matched on selected socio-economic 
factors, 

127. Shores, J. Harlan. “Are Fast Readers the 
Best Readers ?— A Second Report, ” Elemen- 
tary English, XXXVIII (April 1961), 236-45. 

Correlation analysis, involving 46 sixth- grade 
Pupils and 51 advanced undergraduate and graduate 
university Students, of relationship among various 
tests of reading speed and comprehension, including 
three tests devised by the author to measure the 
Teading of science materials. 

128. Shores, J. Harlan. “Reading Science Mater- 
erials for Two Distinct Purposes," Elemen- 
tary English, XXXVII (December 1960), 546- 
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Correlation analysis of general and specific 
reading abilities involved in reading science mater- 
ials for two distinct purposes by 46 s ixth-grade 
children. 

129. Shores, J. Harlan. ‘‘Reading of Science for 
Two Separate Purposes as Perceived by Sixth 
Grade Students and Able Adult Readers, ” El- 
ementary English, XXXVII (November 1960), 
461-8. 

Analysis of questionnaire on reading habits used 
in reading science materials for two different pur- 
poses by sixth-grade pupils, and by their projected 
‘ideal’ reader, in comparison with able college 
readers. 

130. Skapski, Mary. ‘‘Ungraded Primary Reading 
Program: An Objective Evaluation, ”” Elemen- 
tary School Journal, LXI (October 1960), 41-5. 

Comparison of reading achievement by intelli- 
gence levels and schools, of pupils in an ungraded 
primary program with arithmetic and reading a- 
chievement in graded classes atgrades 2 and 3 in a 
Vermont school system. 

131. Smilansky, Sarah. ‘‘Evaluation of Early Edu- 
cation, in Kindergarten and Grade I and II of 
Elementary School,'' UNESCO Educational 
Studies and Documents, No. 42, Paris; UN 
ESCO (1 , 8-17. 

Comparative study of the relationship of two 
types of parental socio-cultural backgrounds, cul- 
turally advanced and culturally underdeveloped, up- 
on reading and arithmetic achievement of Israeli 
primary children; and the adjustment of these child- 
ren to success or failure. 

132. Smith, Edgar A. “Devereux Readability In- 
dex," Journal of Educational Research, LIV 
(April 1 2 -303. 

Description of the development, validation, and 
and computation of the Devereaux Readability Index. 
133. Smith, Donald E. ‘Serum Correlates of Per- 

ceptual Errors in Problem Readers,"' Per- 
ceptual and Motor Skills, II (December 1960), 
S OEE ER WS Van 


Biochemical test patterns characterizing eight 
males, aged 14 to 18, who were severely retarded 
readers. 

134. Smith, Lois, and Becker, J. ‘‘Self-Selection 
with Intermediate Children,” Reading 
Teacher, XIV (November 1960), 83-8. 

Effectiveness of a year and one-half individual- 
ized reading program for 20 children in the upper 
intermediate grades in a Wisconsin city. 

135. Smith, May L., and Eno, I. V. “What Do 
They Really Want to Read?" English Jour- 
nal, L (May 1961), 343-5. 

Analysis of response to questionnaire given to 
510 Iowa students in grades 7-12 to discover what 
students would like to have an author put into a 
**story-to-order'' request from them. 

136. Snider, J. G. ‘‘Achievement Test Perform- 
ance of Acculturated Indian Children, "' Al- 
berta Journal of Educational Research, VII 
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(March 1961), 39-41. 

Comparative performance on the Coo erative 
Testof Reading Comprehension and other tests of 
41 acculturated Nezperce and 79 white students in 
grades 10-12 in an Idaho school. 

137. Sommers, Ronald K., etal. *Effects of 
Speech Therapy and Speech Inprovement 
upon Articulation and Reading,"' Journal of 
Speech and Hearing Disorders, XXVI (Feb- 
ruary › 27-38. 

Series of studies involving 1562 first-grade 
children of average range of intelligence, compar- 
ing the effects of speech therapy and speech im- 
provement programs upon reading and articulation 
in speech defectives and in children with normal ar- 
ticulation. 

138. Spache, George, Standlee, L., and Neville, D. 
**Results of Three College Level Remedial 
Reading Procedures,"' Journal of Develop- 
mental Reading, IV (Autumn 1960), 12-16. 

Co-variance analysis of effects of each of three 
six-week college remedial reading problems (а 
workbook, audio-visual or individual emphasis) up- 
on reading rate, vocabulary, com prehension, and 
attitude toward reading among 98 students іп a 
Florida university. 

139. Stewart, Daniel К. “Com parison of the Re- 
sults of a Logical Questionnaire with Read- 
ing Comprehension, Verbal Ability, and 
Grade Point Averages, ” Journal of Psychol- 
ogy, LI (April 1961), 289-92. 

Analysis of significance of relationships between 
performance on a test of logical thinking and read- 
ing comprehension, verbal ability, and grade point 
average among university freshmen with high and 
low logical reasoning scores, respectively. 

140. Strang, Ruth. ‘‘Preparation for the Teacher 
of Reading, ’’ Journal of Developmental 
Reading, IV (Autumn 1 › 53-7. 

Analysis of questionnaire replies from 77 col- 
leges and universities concerning the number and 
type of courses on the teaching of reading at the 
high school and college level. 

141. Tannenbaum, Percy H., and Lynch, М. D. 
**Sensationalism: theConcept and its Mea- 
surement, ’’ Journalism Quarterly, XXXVII 
(Summer 1960), 381-92. 

Controlled validation study of a Sensa ionalism 
Index (Sendex), using the semantic differential tech- 
niques for measuring connotative meaning, by un- 
dergraduate student ratings of three specially pre- 
pared versions of three news stories. 

142. Turner, Carla S. ‘‘Improving Selection of Pu- 
pils for Remedial Reading: A Report of Re- 
search, " English Journal, L (January 1961), 
23-33. 


Description of the grouping and progress of 73 
poor readers in seven groups, primarily according 
to intelligence and motivation, in a year's study in 
а Florida junior high school. 

143. Walker, Frederic R. “Evaluation of Three 
Methods of Teaching Reading, Seventh 
Grade,"' Journal of Educational Research, 
LIV (May 1961), -8. 

Controlled comparative study among three groups 
of seventh grade students, 86 in all, of effects of a 
six-week period of highly individualized instruction, 
individualization through the SRA Reading Labora- 
tory, and a non-individualized approach. 

144, Webster, Jane. *Using Books to Reducethe 
Fears of First Grade Children," Reading 
Teacher, XIV (January 1961) 159-62. 

Analysis of the effects of bibliotherapy in reduc- 
ing fear of the dark and fear of dogs in 80 first- 
grade children. 

145. Williams, Phillip. “Тһе Growth of Reading 
Vocabulary and Some of Its Implications, 
British Journal of Educational Psychology, 
XXXI (February 1961), 104-5. 

Comparison of usefulness of a vocabulary scale 
based upon a random sampling of the dictionary ver- 
sus an age scale in showing vocabulary progress in 
216 students aged 6-15 years in English schools. 
146. Williams, William G. “Тһе Adequacy and 

Usefulness of an Objective Language Scale 
When Administered to Elementary Scool 
Children, ” Journal of Educational Research, 
LIV (September > 30-3. 

Very brief discussion of a newly developed lan- ` 
guage scale in relation tolanguage achievement, in- 
cluding reading, and to teacher and speech clinician 
judgments of 140 children in grades K-6. 

147, Wolfson, Bernice J. ‘‘What Do Children Say 
Their Reading Interests Are?" Reading 
Teacher, XIV (November 1960), 81-2,111. 

Analysis of a 120-item reading interest inventory 
administered orally to 1000 girls and 942 boys in 
grades 3-6 in a Connecticut school system. 

148. Yandell, Maurine D., and Zintz, M. V. ''Some 
Difficulties Which Indian Children Encounter 
With Idioms in Reading,'' Reading Teacher, 
XIV (March 1961), 256-9. 

Comparative study of significance of differences 
in performance onan idioms test by 516 sixth-grade 
children representing Anglo, Spanish-American, 
Zuni, and Navaho ethnic groups in a New Mexico 
county and a control group of 390 Anglo children. 
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The Effects of Purpose on the Reading of Expositroy 
Mathematical Materials in Grade Eight’ 


Introduction 


TEST MAKERS have long given their attention to 
the measurement of speed and comprehension in 
reading. These measures have been summarized and 
the resultant measure has been called an estimate of 
a person's general reading ability. The existence of a 
general reading ability and, consequently, the ade- 
quacy of its measure beyond the elementary grades 
has been questioned. The critics of the conceptof 
general reading ability as ability to read all kinds of 
material have presented the argument that asthe de- 
mands for reading become more diversified some 
Skills must take precedence over others according 
tothenature ofthe reading task. Thus, they say that 
general reading ability is a fiction and there are many, 
not one, reading abilities. It is their contention that 
ability to read is influenced by a number of factors, 
many of which are slighted or totally ignored in the 
construction of general reading tests. 

The study presented here** attempted to m eet 
Some of the criticisms that have been directed toward 
general reading tests. First, thegeneral reading a- 
bilitytests have been criticized in their selection of 
content. The materials forthis study were selected 
for their relation to one subject matter area--math- 
ematics. A further refinement was the limiting of the 
materialsto expository mathematical material. Sec- 
ond, general reading tests have been criticized for 
basing their scores on relatively brief passages. The 
passages selectedfor this study were of more than 
One paragraphinlenth. They ranged from 174 to 444 
Words intotallenth, averaging 326 words. Third, gen- 
eral reading tests have not consistently considered the 
reader's purpose for reading. The instructions for 
reading the passages included in this study set the 
Purpose for which the passage was to be read-- (1) to 
answer a specific questionor(2) to get the main idea 
of the passage. A fourth criticism has been that the 
Teader’s background of information about the topic 
being discussed has caused the scores on the test to 
give an inaccurate picture of his ability to read. The 
topics of the materialsselected for use in this study 
Were not found in the arithmetic texts used in 
the school system in which the study was made. 
а inns m Ico 
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Purpose of the Study 


This study was designed to investigate relation- 
ships within and between measures of ability to read 
expository mathematical material and between each 
of these measures and measures of general reading 
ability and of arithmetic achievement. 


Testing Instruments 


Since there was no commercially produced test 
that gave a reading score on expository mathematical 
material itwas necessary to construct one. Тһе de- 
tails of its construction have been reported else- 
where. 

. The measures of general reading ability were 
scores on the Iowa Silent Reading Tests? and the 
California Achievement Tests, 3 reading section. 

e measures of arithmetic achievemeht were 
ScoresontheCalifornia Achievement Tests, arith- 
metic section. 

The measures of intelligence Were non-language 
mental ages on the California Test of Mental Matur- 


Experimental Procedure 


This was a two-group experiment. Group А, 
twenty-two eighth grade students, read twenty pas- 
sages to answer one specific question about each 
passage. Group C, twenty-three eighth grade stud- 
ents, read the same twenty passages to find the main 
idea of eachpassage. Group A and Group С: had been 
equated on the basis of Iowa Silent Reading Tests 
scores. 

One exercise was completed on each of twenty 
successive school days. Essentially the same pro- 
cedure was followed each day. First, the instruc- 
tions for completing the exercise were read. After 
asking for and answering questions about procedures, 
the instructor started timing at the word “ро”. Time 
was recorded in five second intervals by increasing 
a time indicator by one number every five seconds. 


ў This study was supported by a research grant from Spencer Press, Incorporated. 
* Submitted in partial fulfillment of the requirements for the Doctor of Educationin 


the Graduate College of the Univeristy of Illinois. 
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When the student completed readingfor his purpose, 

he recorded the number showing onthe indicator. 
He then wrote or selected an answer, depending upon 
the purpose for which he wasreading. Finally, each 
student completed a checklist describing his method 
of reading the passage. 


Comparisons Using Speed and Accuracy Scores 


The findings of the study as they relate to six- 
teen hypotheses concerning speed and accuracy are 
found in Table I (See next page).The findings were 
grouped about related hypotheses in order to facili- 
tate their presentation. Unless designated otherwise, 
references to speed and accuracy are to the scores 
for speed and accuracy. derived from the tests of 
ability to read expository mathematical material. 

The differences between speed scores for the two 
groups (Exercises A- Group A and Exercises C- 
Group C) and with the differences between accuracy 
Scores for the two groups were tested by using the 
t test of significance for equated groups. 5 The re- 
‘sults are reported in Table I. 

The greater mean score for Group C indicates 
that it took more time to read to find the main idea 
than to answer a specific question, The significance 
level of .01(1%) means that a difference between 
Scores as great as 51.57 can be expected to occur 
one time out of one-hundred if only chance factors 
are operating. Thegreater mean scorefor Group A 
for accuracy indicated that, as a group, those who 
readto answer a specific question were more suc- 
cessful thanthose who read to find the main ideas. 
The significance of the difference between the means 
of the accuracy scores is not reported since it was 
not possible to show that the two tests of accuracy 
were comparable. 4 

The relationships between the speed and accuracy 
Scores andthe scores on general reading ability tests 
were investigated by computing the product-moment 
coefficients of correlation between sets of scores. 

These coefficients are influenced by the unrelia- 
bility of the scores used in their computation. Тһе 
coefficients (Fxty) that would be obtained if the meas- 
uring instruments constructed for use in this study 

were perfectly reliable can be estimated by correct- 
ing the actual scores for attenuation. The influence 
of intelligence on the scores was also investigated 
by finding the partial coefficient of correlation (*xyz) 
with intelligence ‘‘partialled ош?” The coefficients 
of correlation, the corrected coefficients, and the 
partial coefficents are reported in Table II. 

The coefficients show that there are relationships 
between scores. The students who read the exposi- 
tory mathematical material faster and with greater 
accuracy tended to be higher scores on the general 
reading ability tests. Six of the eight coefficients 
are significant at less than the 1% level. This means 
that a coefficient of such magnitude would occur less 
than one time in a hundredif the two variables were 
related. The highest correlations, -.86 and-.84 , 


~ 


г 


show a fairly definite relationship between fast read- 
ing of the mathematical material and high scores on 
general reading ability tests. The negative coeffi- 
cients are explained by the factthatthe higher scores 
for speed indicateslower reading. The comprehen- 
sion scores for the Exercise C's should be inter- 
preted with the realization that the maximum cor- 
relation possible with these scores is .80. 

An inspection of Table II shows but little differ- 
ence in the correlations with the two general read- 
ing tests. However, some of the correlations appear 
to be considerably higher for one group than for the 
other. The differences between the coefficients in- 
volving speed are significant at less than the 2% 
level. 

The corrections for all coefficients except those 
involving accuracy scores on the Exercise C's are 
rather small (. 01 to .09). The reason for the greater 
Change in the coefficients involving comprehension 
scores on the Exercise C's is the relatively lowco- 
efficients of equivalence for that set of scores. 
These coefficients are of theoretical interest only. 
Instruments with perfect reliability are not available. 
The coefficients do give a more exact indication of 
the true relationships that exist. The larger coeffi- 
cients for the Exercise C's (.80 to . 87) indicates 
a closer relationship with general reading ability 
than the Exercise A's. 

S n partial coefficients in all cases are less than 
the-coefficients derived from the actual scores 
(. 07 to . 29). The order of the coefficients іп terms 


„of magnitude is not changed. The partial coefficients 


‘Supposedly show a pure relationship between speed 
and general reading ability or between accuracy and 
general reading ability. However, in all proba- 
bility, they are underestimates of the true relation- 
ship because the intelligence scores are not **pure" 
measures of intelligence, but include other factors, 
including some reading ability. Again, as when the 
scores were corrected for attentuation, the signifi- 
cance of the differences between correlations is lit- 
tle affected. The strong relationship between speed 
on the Exercise C’s and general reading ability is 
evident as it was with the actual scores. 

The Iowa Silent Reading Tests battery includes 
two parts that provide scores somewhat similar to, 
but not exactly like, the speed and accuracy scores 
derived from the tests constructed for this study : 
They are alike in that the speed and comprehension 
accuracy scores are derived from the same pass- 
ages. Thepassages for this test are not from math- 


ematics but from science and the social studies. | 


There is one passage from each of these subject 
matter areas. The coefficients of correlation, the 
corrected coefficients, and the partial coefficients 
between the two speeds of reading and between the 
two comprehension-accuracy scores are present! 
for both groups in Table III. 

The scores on the Exercise C's аге more closely 


related tothe partscores from the Iova Silent Reat- 


ing Tests than are those on the Exercise А’. 
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TABLE I 


DIFFERENCES BETWEEN MEAN SCORES FOR SPEED AND FOR ACCURACY 


Standard Degrees Level of 
Scores on A-C Error t of Freedom ‘Significance 
Speed -51.57 19.11 2.70 41 P=.01 
Accuracy 2.13 .87 
ж ” ў ж 
á * 
TABLE II есй 


LÀ , „ 
RELATIONSHIPS BETWEEN SPEED AND ACCURACY SCORES AND 
GENERAL READING ABILITY SCORES 


——————— 


Coefficient Corrected Partial 
Exercises Scores on of Correlation Coefficient Coefficient 
о I Lu eru LT А 

A Speed -- ISRT -. 48(. 02€ P<. 05) -.49 -.26 

A Accuracy--ISRT .5Т(Р<.01) .64 .28 

с Speed--ISRT -.86(Р<.01) -.87 «алі; 2.19 

с Accuracy--ISRT .65(Р<.01) ШУМЕН су ‚52 

А Speed--CAT -.42(.05«P«. 10) -.43 -.16 

A Accuracy--CAT .73(р<.01) .82 .54 

c Speed--C AT -.84(р<.01) -.85 5.16 

с Accuracy--CAT  — .68(PC.01) .85 .58 
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TABLE III 


COEFFICIENTS OF CORRELATION BETWEEN SPEED SCORES 
AND BETWEEN ACCURACY SCORES 


Iowa Silent Reading Tests Corrected Partial 
Exercises Scores on Speed Comprehension Coefficient Coefficient 
D —————— ыс агуус пк} Анаш T——— A с түзі. еее 
A Speed -. 36(P-. 10) -.37 -.11 
с Speed -.88(P<. 01) -.89 -.82 
A Accuracy .49(. 02 P<. 05) .55 .20 
с Accuracy . 63(P<. 01) .79 .39 


TABLE IV 


RELATIONSHIPS BETWEEN SPEED AND ACCURAC Y SCORES AND 
ARITHMETIC ACHIEVEMENT SCORES 


California Corrected Partial 
Exercise Score on Achievement Tests Coefficient Coefficient 
A Speed -.34(P>. 10) -.36 -.05 
A Accuracy . 65(P«. 01) „75 .41 
с Speed -.41(.05<Р<.10) -.43 -.18 
с Accuracy . 61(P<. 01) .78 .43 
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TABLE V 


DIFFERENCES BETWEEN FIRST HALF SCORES AND LAST HALF SCORES 


qq АААААЕАААААЕЕЕЕЕЫ——_—–ь————— 


Standard Degrees of Level of 
Exercises Scores on F-L Error t Freedom Significance 
A Speed 6.86 6.97 .21 21 P205 
A Accuracy .81 .39 2.08 21 P=.05 
с Speed -2.00 5.42 .37 22 P205 
с Accuracy -.08 .52 .15 22 P205 
TABLE VI 


RELATIONSHIPS BETWEEN SPEED AND ACCURACY SCORES FOR BOTH GROUPS 


LUAM XAR 


Coefficient Corrected Partial 
Group of Correlation Coefficient Coefficient 
кш ——————————————————— 
А -.54(Р<.01 -.62 Е Eh 
с -.49(.01<Р<.02 -.62 : 
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same relationship exists for the corrected and par- 
tial coefficients. The coefficients for the Exercise 
A's are not only smaller but are significant at a 
lower level. These coefficients should be interpret- 
ed with caution, for while the part-scores on the 
Iowa Silent Reading Tests were derivedfrom two 
reading passages, the speed scores were not derived 
іп the same manner nor were the tests of compre- 
hension and accuracy alike. The negative relation- 
ship shows that the faster readers of one kind of 
material tended to be faster readers of the other 
material as well. The negative relationship results 
from the use of different scoring methods. The high 
Scorers for accuracy in reading the expository math- 
ematical material tended to be the high scorers for 
comprehension on the Iowa Silent Reading Test. The 
difference between the correlation for Group A and 
that for Group C on the speed scores is significant 
at less than the 1/2 percent level. 

The relationships between both the speed and ac- 
curacy scores and the arithmetic achievement scores 
were investigated by computing the coefficients of 
correlation, Ше corrected coefficients, and the 
partial coefficients. The findings are presented in 
Table IV. 

The same cautions should be observed in inter- 
preting these coefficients as with thosefound in 
Table III. The coefficients with accuracy scores as 
one variable are higher than those for speed. They 
are significant at the 1% level. The same relation- 
ship exists for the corrected and partial coefficients. 
Neither of thecorrelations with Speed can be accept- 
ed at the 5% level, though the speed scores on the 
Exercise C’s approached this level. 

The t test was used to test the significance of the 
differences between the first-half (Exercises 1-10) 
and the last-half (Excersises 11-20) scores for all 
four sets of scores. The results are presented in 
Table V. 

The scores on the first-half were better than the 
scores on the last-half for speed on the Exercise 
C’s and for accuracy on the Excercise A’s; the op- 
posite was true for the other two sets of scores. 
Only the difference between first-half and last-half 
scores for accuracy on the'Exerc ise A's is signif- 
icant at the 5% level. 

The relationships between speed and accuracy for 
both groups was investigated by computing the co- 
efficients of correlation. The coefficients of corre- 
lation, the corrected coefficients, and the partial 
coefficients are presented in Table VI. 

For both groups the coefficient indicates that the 
faster readers tend to be more accurate. The co- 
efficients can be accepted at less than the 1% level 
for Group A and at less than the 2% for Group С. 
The difference between the coefficients is not sign- 
ificant at the 5% level. 

The generalizations that could be made from the 
findings of this study were restricted by certain 
limiting factors: (1) limited sampling, (2) atypical 
behavior resulting from the influence of the experi- 
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mental setting, (3) lack of assurance that every stu- 
dent was reading for only one purpose, (4) deter- 
mination and comparison of accuracy score, and 
(5) no objective measure indicating equivalence of 
the content of the twenty passages. 


Conclusions 


1. The purpose for reading influences the speed 
with which the material is read. 

2. General reading ability tests measure the 
kinds of things measuredby the Exercise C's better 
than the kinds of things measured by the Exercise 
A's. Thisstatement should notbe construed to mean 
that Group C or Group Aattacked their reading tasks 
in a preferable manner. It does indicate that the 
skills or information employed by both groups may 
not be the same. Afurther indicationisthat general 
reading ability tests are better predictors of success 
in some kinds of reading situations than in others. 

3.) Those who read the expository mathematical 
material faster and with greater accuracy also tend 
to achieve higher scoreson the general reading abil- 
ity tests. The relationshipis nota perfect one, even 
when corrections for attenuation are made. This in- 
dicates thatfactors in addition to the kinds of things 
measured by general reading ability tests influence 
fast and accurate reading of expository mathematical 
material. 

4. The relationships between high scores ongen- 
eral reading ability tests and superior scores for 
Speed and accuracy are reduced when intelligence 
can no longer be considered a factor influencing the 
relationship. 

5. Speed for reading the timed passages of the 
lowa Silent Reading Testsis more like speed on the 
ExerciseC'sthanonthe Exercise A's. This is added 
indication that speed is influenced by the purpose for 
reading. 

6. Comprehension of the timed passages of the 
Iowa Silent Reading Tests is more like accuracy on 
the Exercise C's than on the Exercise A's. This is 
added indication that accuracy is influenced by the 
purpose for reading. 

7. Speed and comprehension scores on a general 
reading test are not necessarily good predictors of 
Speed and accuracy for other kinds of material read 
for other purposes. 

8. The kinds of things measured by speed and, 
toalarger extent, accuracy are factors in arithmetic 
achievement. The relationships, though reduced con- 
Siderably, exist after the influence of intelligence 
18 negated. Ability to read expository mathematical 
material Successful is by no means an accurate 
indicator of success in arithmetic as measured by 
an arithmetic achievement test. 

9. Practice withoutinstruction does not produce 
better scores for speed and accuracy over a relativ- 
ely short period of time. 

10. Thereis а tendency for faster readers to be 
more accurate when they are reading expository 
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mathematical material. The size of the coefficients 
indicates that there are many exceptions when the 
material is read for either purpose. 
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A Study of the Vocabulary Needs of a Class of Juniors and 
seniors at the Alabama State College 


THE IDEA of competence or success in course 
achievement embodies the ability to read with under- 
standing, and to read with understanding presupposes 
that the student will have at his command an abun- 
dant supply of sight words in order to perceive large 
configurations of ideas. Obviously, a knowledge of 
words is indispensable to the learner. Hence, few 
teachers, irrespective of grade level or of the par- 
ticular course taught would minimize the truism that 
‘for reflection, the comprehension of words is most 
essential, and that students with impoverished or 
inferior vocabularies seem to give little thought to 
whatever problem, issue, or subject matter which 
happens to be under consideration. 

Upon the aformentioned rationality, efforts were 
made to improve the word knowledge of juniors and 
Seniors who were enrolled inthe history of education 
classes for the Fall Quarter of 1959 at the Alabama 
State College, with the assumption that there would 


be some correspondence between the language and 
thought processes of the students thus involved. 


The Problem The following questions evolve from 
the investigation; (1) What is the vocabulary status 
of the students when measured by a standardized 
vocabulary test? (2) What area of subject matter 
represents their highest achievement in vocabulary? 
(3) In what subjectarea is there limited word knowl- 
edge? (4) Dothe major and minor curriculum choices 
affect the students' vocabulary achievement in the 
areas specified on the test? (5) Was noticeable im- 
provement made in a knowledge of words following 
а period of instruction in which word study was 
assigned? (6) What was the achievement of the stu- 
dents on the teacher -made test which comprised un- 
familiar words submitted by them ? (7) To what extent 
does the teacher- made vocabulary test and the stand- 
ardized vocabulary test measure the same word 
knowledge? (8) What additional approach сап be made 
o improve the word knowledge and Subject matter 
'ompetence of the students? 


?rocedure During the first week of the Fall Quar- 
er of 1959, 124 students of junior and senior clas- 
ification presented themselves for instruction in 
he history of education, a course primarily con- 


Footnotes will be found at the end of the article. 
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cerned with the education of early people; in fact, it 
pervades the whole gamut of the culture, schools, 
curriculums, personalities and ideologies of the 
Egyptians, Hebrews, Greeks and Romans. Also, 
the course includes the influence of great movements 
upon education, such as Christianity, the Renais- 
sance, the Reformation, and Realism. 

Questions evolved from a period of general ori- 
entation and lecture. Some of these questions were 
phrased to see if the students had grasped the point 
of reference intended and to determine if the students 
understoodthe terminology used. This approach was 
poignantly motivating since the desire and plan to 
give assiduous attention toa study of unfamiliar 
words came from it. 

The protocol consisted of the following steps or 
procedures: 

l. Administering the vocabulary section (Form 
А) ofthe Diagnostic Reading Test authored by 
Frances O. Triggs and others; 

2. Assigning the students to look for the meaning 
of all unfamiliar words encountered in their 
reading, to give the source Of the word and 
the sentence in which it was used, and to sub- 
mit these word lists once each week; 

3. Administering the vocabulary section of the 
Diagnostic Reading Test (Form B) 

4. Administering a teacher-made vocabulary test 
incorporating the unfamiliar words submitted 
by the students. 

The standardized vocabulary test extends from 
grade seven through the college freshmen year. It 
has a reliability coefficient which ranges between 
-82 and . 94 based upon a random selection of stu- 
dents’ papers in various colleges. 2 Тһе total scores 
are reported to give a reliable vocabulary index. 
The specific categories are (1) General (2) English, 
(3) Mathematics, (4) Science, and (5) Social Stud- 
165. 

Тһе teacher-made test consisted of 135 words 
which the students indicated as unfamiliar. The 
Sources of the words were traceable to the text- 
books,“ to approximately 300 supplementary 
books, 100 periodicals, magazines, and bulletins. 
The frequency of occurrence of the words indicated 
their importance and cruciality, these criteria were 
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the bases for the selection of the test items. In all 
there were 6, 624 running words. 


The Results Table I presents data which show the 
achievement of 124 students on Form A of the vocab- 
ulary section of the Diagnostic Reading Test by 
Frances Triggs and others, and also the achieve- 
ment of the students who are compared with the norms 
which were established by 536 college freshmen. 

In the initial test the juniors and seniors achieved 
a median of 69.70, which placed 50 percent of them 
above this score, on the second test they reacheda 
median of 92.5 indicating a gain of 22.8 median 
points. Since the median for the freshmen who con- 
Stituted the sample is 98, it is obvious that the jun- 
iors and seniors fall short of this norm by 5. 5 median 
points. However, the improvement made as evi- 
denced in the last testing justify the value of extensive 
vocabulary study. It is noticed further, that when 
the averages are computed for total vocabulary 
achievement, as shown in Table II for Forms A and 
B ofthe standardized test, the gain in means between 
the initial and final testing is 19. 57 points; this figure 
is statistically significant at the 1 percent level of 
confidence since the “С” ratiois 6. 99. An inspection 
ofthe variability scores indicated that the students 
forthe most part were similar in their knowledge 
of words, with the exception of one or two individu- 
als whose scores fell into extreme positions of the 
distribution. 

When the separate categories are inspected as to 
achievement, it can be observed that on both forms 
of thetest, the students were limited in a knowledge 
Of words in the social studies. They made their 
lowest scores in this area; in seriatim the medians 
Were for Forms A and B, 9.19 and 13. 32; similarly 
in the same order the arithmetical averages were 
9.95 and 13. 99. Since the established norm was 20, 
the students were 6. 68 scores short of the standard 
Setby freshmen. However, the gain between the 
firstand second tests was significant, a result which 
focuses attention upon the effectiveness of word 
study. 

The results attained by the students in a knowl- 
edge of mathematical words may at first surprise 
the reader. The medians on the initial and second 
tests were 17. 62 and 17. 45, while the freshmen norm 
із 20; hence, the students are only 2.55 median 
Points below ће freshmen norm. The means for the 
mathematics vocabulary are in seriatim for both 
tests 17.74 and 17. 78. Obviously, the difference is 
Dot statistically significant at either the5 percent 
9r 1 percent levels of confidence. 

И сап be hypothesized that several factors produce 
these results. Consideration is given to afew factors 
Which seem rational at this point. It is highly prob- 
able that the students had previously invested more 
time in a study of complex mathematical terminol- 
ОБУ, since their initial score exceeded those attained 
In the other subject matter areas. Also the failure 
to make а significant gain between the first and 
Second tests may be accounted for by the good lead 


exhibited in the first test, and the absence of math- 
ematical words in the vocabulary assignment in 
which the students engaged. A final assumption is 
that there may be more unanimity in the teaching of 
mathematical concepts in the secondary curriculum 
than in the case of the social studies. Knowledge 
concerning differences which are found in the a- 
chievement of mathematics as compared with other 
subjects isnotentirely new. DeCosta? who reported 
the test performance of high school seniors disclosed 
that the mean difference between verbal ability and 
quantitative ability for 482 male seniors was -4.71 
with a standard error of .56. This difference was 
in favor of quantitative ability. The mean difference 
between verbal and quantitative ability for 757 female 
high school seniors was -2.65, againindicating that 
their performance was bettter in mathematical abil- 
ity than in verbal ability. Of course, DeCosta's 
inquiry was concerned with mathematical knowledge 
while the present study involves a knowledge of 
mathematical vocabulary. There may be some 
degree of similarity between the two findings. 

The second highest score was made in English; 
the medians were 17.40 and 21.50 on the two pre- 
viously namedtests. The norm established for Eng- 
lish is 20. It is clear that the students surpassed 
the freshmen norm by 1. 50 median points. The means 
of 17.18 and 21.02 on Forms A and B of the stand- 
ardized tests are presented in Table II. It can be 
likewise noticed here, that the gain made as a result 
of word study was highly significant for English 
achievement. 

In analyzing supplementary dataas to the number 
of students with majors and minors in the subject 
areas of the test, it was disclosed that 16 persons 
were Englishmajors, while 32 chose it for a minor; 
8 students were mathematic majors, while 10 took 
it as a minor; 6 students elected scienceas a major; 
while 2 chose it as a minor;10 students majored in 
the social studies, while 15 took it as a minor sub- 
ject. Hence, thenumber of individuals with previous 
experience іп the areas tested did not seem to 
affect seriously vocabulary achievement in every 
instance. A total of 48 people manifesting interest 
in English could have accelerated the score, while 
the sameisnottruefor the mathematics, since only 
18 people comprised this group. For social studies, 
the total for majors and minors is 25; itis recalled 
that the lowest score was made in this subject. 

The teacher-made vocabulary test consisted of 
135 words. A mean score of 60. 6 indicates that the 
students were successful with a little more than 
half of the words which were submitted as difficult 
and studied by them. These dataare shown in Table 


IH. 

The total achievement scores on Form B of the 
vocabulary section of the diagnostic test when cor- 
related with the vocabulary scores on the teacher- 
made test yielded a value of .392. This coefficient 
of . 392 gives validity to the teacher-made test since, 
according to Engle, approximately 63 percent of the 
cases are in agreement. * Therefore, the two tests 
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TABLE I 


VOCABULARY ACHIEVEMENT OF 124 COLLEGE STUDENTS FOR 1959, ON THE 
VOCABULARY SECTION OF THE DIAGNOSTIC READING TEST 
(BY TRIGGS AND OTHERS) 


Parts of the Form A Form B Norms for College 
Test ER Freshmen 
Md. Q1 Q3 Md. Q1 Q3 Md. Q1 Q3 
General 14.24 10.4 16.5 18.86 14.4 22.1 18 13 22 
English 17.40 14.2 20.5 21.50 18.4 24.4 21 17 24 
Mathematics 17.62 14.1 21.3 17.45 14.4 21.4 20 16 24 
Science 15.77 9.8 19.6 19.27 14.8 22.4 21 17 24 
Social Studies 9.19 1.5 18.0 13.32 10.2 19.0 20 17 23 
Total 69.70 55.5 83.9 92.5 75.4 105.2 98 84 112 


Number of students 124 122 536 


TABLE II 


DIFFERENCES IN VOCABULARY ACHIEVEMENT OF COLLEGE STUDENTS 
AT END OF WINTER QUARTER, 1959, ON VOCABULARY | 
SECTION OF THE DIAGNOSTIC READING TEST ! 

(BY TRIGGS AND OTHERS) 


Form A Form B 
Parts of the Standard Standard а 
Test Mean Deviation Mean Deviation ratio Degree of Freedom 
General 14.00 4.3 18.16 5.7 6.51 244 
English 17.18 4.8 21.02 4.6 6. 50 244 
Mathematics 17.74 5.6 17.78 5.2 .059 244 
Science 14.73 6.4 17.96 6.4 3.95 244 
Social Studies 9.95 7.5 13.99 5.4 4. 93 244 
Total 70.76 24.0 90. 33 20.0 6.99 244 


Number of students 


A 


ye. 
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TABLE III 
ACHIEVEMENT OF 131 COLLEGE STUDENTS 
ON A VOCABULARY TEST IN THE 
HISTORY OF EDUC ATION 
(WINTER QUARTER 1959) 


Test of 135 Words 


Median 47.0 
Qi 29.9 
Q3 83.7 
Mean 60.6 
Standard Deviation 

Range of Raw Scores 0 - 130 
Number of Students 131 


Seem to measure the same thing; in this particular 
case, they measure a knowledge of words. 

When the scores which fall іп the extreme positions 
of the distribution of the two tests are closely ana- 
lyzed and compared , the teacher-made test seems 
to be more discriminating for the upper level of stu- 
dents than for the lower level. In other words, the 
items for the better students tend to discriminate 
equally. 


Conclusions and Recommendations The results 
Seem to warrant the following conclusions: 

1. The juniors and seniors in this investigation 
surpassed the freshmen norm in general vo- 
cabulary achievement on the final test by . 86 
median points. Their median was 18. 86, while 
the norm is 18. 

2. The median achieved in English vocabulary on 
the final test was 21.50, while the test norm 
is 20. It is clear here that the difference is a 
narrow margin of 1.50 above the freshmen 
level. 

3. In mathematics vocabulary, the students were 
below the freshmen norm by 2. 55 points; they 
attained a median of 17.45 in the final test, 
while the norm for mathematics is 20. 
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4. In science vocabulary the students achieved a 
median of 19.27 while the norm is 21, hence 
they fall short of the freshmen median by 1. 73 
points. 

5. In social studies the students failed to reach 
the freshmen norm by 6. 68 median scores. 

6. In all, the vocabulary achievement of the jun- 
ior and senior students who comprised this 
study was not complimentary, since generally 
they performed no better than freshmen. 

7. The design of having students study and submit 
weekly individual lists of words paid off sig- 
nificant dividends. Only in the area of mathe- 
matics was the vocabulary gain negligible. 

8. It was observed that the students as a whole 
profited from the study of unfamiliar words 
in that their recitations were morelogical and 
better enunciated after having had the experi- 
ence. 

9. Another vocabulary study might be pursued in 
which the equivalent group technique is em- 
ployed. Also the experimental variable could 
be so designed as to give greater depth to the 
selection and study of words which are difficult 
to students. 

10. The grade for the course “ The History of 
Education,"' if correlated with vocabulary 
achievement, might have predictive value for 
subsequent instruction. 

ll. The number of words submitted by each stu- 
dent correlated with his vocabulary achieve- 
ment score mightgive corroborating evidence 
to the value of word study. 
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Junior Primary in the Van Dyke Level Plan 


THE PROBLEM of keeping children out of formal 
reading until they had reached the necessary state 
of readiness was chosen to meet a community need. 
On an individual psychological examination admin- 
istered to all kindergarten children in October, 
1953, the average IQ was found to be 99. This 
pointed up the fact that we had a high proportion of 
children whose mental ages were lower than their 
chronological ages. Also, the majority of the child- 
ren seemed not to bring an adequate experientail 
background to the reading situation. 

In September, 1943, a pilot study conducted in 
our schools researched the possibility of replacing 
the rigid grade system bya flexible plan of pacing 
the child to his own rate of speed in the reading sit- 
uation. The plan was called The Level System. 


Assumptions and Hypothesis 


1. Maturation is an individual characteristic of 
each child and his progress must be individually 
tailored to his needs. 

2. According to the normal curve of distribu- 
tion, one third of the children entering first grade 
have not reached the mentalage of six years and six 
months. 

3. A mental, social and emotional maturity e- 
quivalent to that of an average six-year-six-month- 
old child is a prerequisite for reading. 

4. Lack of adequate functionalvisual and/or au- 
ditory acuity greatly limits the chance of success 
in the reading situation. 

The hypothesis entertained was that a period of 
reading readiness between kindergarten and formal 
reading was essential for some children. 


Subjects 


The experimental group consisted ofall children 
in the district who left kindergarten in February, 
1953. The control group was composed of those 
who had left kindergarten in February, 1952. Only 
children of these original groups who were in the 
Van Dyke Schools four years after leaving kinder- 
garten were used. 


Setting up the Study 


Personnel involved were Director of Elementary 
Education, principals of ele mentary schools, psy- 
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Groups No. in Group at No. in Group at 
Start of Research EndofResearch 

Experimental 179 103 

Control 103 83 


chologist, reading diagnostician and teachers of the 
experimental group. 

АП children of the experimental group, оп leav- 
ing kindergarten, were given the Pintner-Cunning- 
ham group psychological test. If it was considered 
necessary, an individual psychological augm e nted 
this information. 

Each child's vision was screened on ће Keystone 
Tellibinocular and referral to an ophthalmologist 
was made when indicated. Allchildren were check- 
ed for emotional and soc ial maturity against check 
lists constructed by the personnel involved. Any 
child whose mental, social, or emotional age was 
less than six years and six months was placed in the 
readiness situation. The other childrenwere placed 
in Level 1 (Beginning Reading). 

To insure a complete set of records for both the 
experimental and control groups, numerous forms 
and individual record charts were constructed. 
Meetings on released time were held for teachers 
of the experimental group in order to involve them 
in planning procedures and evaluationprocesses. 
Teachers of the control group were instructed only 
on methods of keeping records. 


Procedures and Techniques 

When the experimental group left kindergarten, 
placement was made in terms of the results of the 
tests that had been administered. Children ready 
for formal reading were placed inLeveli, those 
needing more readiness than the basic series pro- 
vided were placed in Junior Primary A. The very 
immature and slow learning children were placed in 
Junior Primary B. Whenever possible, the Junior 
Primary children were placed in a room where no 
formal reading was taught. Groups were to be flex- 
ible and mobile. When the controlchildren left kin- 
dergarten, they had all been placed in Level 1 and 
followed the Level System Plan. 

There were five Reading Readiness books avail- 
able plus all the material constructed by the teach- 


ers for use in the Junior Primary. The number 
used by each child depended on his individual need. 
The reading readiness slides of the Keystone Tach- 
istoscope were also used as a basictool. Emphasis 
was placed on teaching the necessary skills of func- 
tional visual and functional auditory acuity. To 
move from Junior Primary to Level 1, a child had 
to have the mental, socialand emotional maturity of 
the average six-year and six-month-old child. He 
also had to score average or above on the Metro- 
politan Reading Readiness Test. 

At the end of two years, one of the most impor- 
tant findings of the research became apparent, 
namely, that a child might score average on a total 
Reading Readiness Test, but be in a very low per- 
centile in areas measuring functional visual and/or 
functional auditory acuity. Suchachild was doomed 
to failure on Level 6 (1A) or Level 8 (2B), although 
he might have succeeded on Level 2(Pre-Primer) 
and Level 4 (Primer). АП children in the exper i- 
mental group who had failed a test on any level had 
a diagnostic study made of alltheir previous reading 
tests. Generally, such a study uncovered the fact 
that the child had previously failed inspecific areas 
at least two books before his total score showed 
failure. 

Previously, the child was offered somerein- 
forcement of all skills and techniques at the level 
where he had failed. With this new insight, efforts 
Were made to pick out those areas where inadequacy 
was evident and to give specific help there. Discus- 
Sion of this problem at teachers’ meetings resulted 
in the cooperative construction of consonant cards 
and geometric figure cards which were to be used 
in reinforcing functional visual and functional audi- 
tory acuity. 

As the experimental group moved into formal 
reading, teachers became adept in the use of diag- 
nostic techniques. They wereable to identify caus- 
ative factors of failure. Discussion of this new in- 
Sight resulted in construction of materials and 
workbooks for reinforcing techniques introduced, 
but not permanently fixed, inthe developmental pro- 
gram. 


Findings 

Because of the disparity in the numbers in the 
experimental and in the control groups, results 
Were figured on a percentage basis. Only children 
of the control group who were inour schools in Feb- 
ruary, 1956 and those of the experimentalgroup who 
Were present in February, 1957 were used in com- 
puting the statistics. Findings are: 

1. Children in the experimental group showed 
Superior achievement. А smaller percentage of 
them scored below average and a higher percentage 
of them scored above average than did the control 
group. 
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2. Results of the Iowa Basic Reading Tests ad- 
ministered to the control group in February, 1956 
and to the experimental group in February, 1957 
Showed that 52. 875 of the experimentalgroup scored 
at or above 4A as compared to 34. 3% of the control 
group. This in spite of the fact that over one third 
of the experimental group were delayed in Junior 
Primary before they started formal reading. 

3. The greatest benefit tothe Van Dyke Plan re- 
Sulted from the fact that teachers of the experimen- 
tal group who were participating in the Action Re- 
search taught in rooms close to rooms used by 
teachers of the control group. Their enthusiasm, 
newly developed philosophy, and know ledge of new 
skills and techniques spread from teachers of the 
experimental to teachers of the control group, 
These in turn subconsciously resorted to the use of 
the same teaching methods. 

4. A hopeful finding was that educators are wil- 
ling to study any new plan that will promote child 
growth. To lend credence to this idea, seventy-two 
interested administrators and teachers visited the 
Van Dyke Schools last year. They observed the plan 
in action, These visitors came from Metropolitan 
Detroit, outstate Michigan, Canada and Ohio. 

5. A retarding factor in the progress of the pro- 
gram was the teacher who was unwilling to cooper- 
ate. There were also afew teachers who were still 
unable to recognize the opportune time to accelerate 
or slow down a child’s program. A rather preval- 
ent disadvantage was that primary teachers desire 
to **even ар” reading groups. 

6. Afinding that was both a surprise and a dis- 
appointment was the necessity for central leader- 
Ship to achieve continuing success on a highlevel. 
This might be due to the fact that teachers are re- 
tiring and new teachers are taking their places. 


Conclusions 


The findings of this research led to the following 
conclusions: 

1. Children not ready for reading profit more by 
being kept in a readiness situation until prepared for 
formal reading than by being placed in Level 1 (1st 
grade), failing and having to repeat the work. 

2. Moving children ahead into the formal reading 
program only after they succeedon the Junior Pri- 
mary Level does not delay their school progress. 
Their rate in moving ahead depends on their speed 
of maturation in mental, socialand emotional areas. 
This cannot be accelerated. 

3. Programs in which teachers cooperate in the 
planning and execution produce enthusiastic partici- 
pation on the part of teachers andis reflected in the 
attitude of the children. 

4.. Material constructed to meet specific district 
needs is more readily accepted and more frequently 
used if it is a result of group action. 
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The Extent of the Use of the Left Hand in Handwriting 


OF ALL the pupils in public schools who have 
been neglected, or subjected to inadequate or poor 
instruction, the pupil who insists uponusing theleft 
hand for handwriting, in the writer's opinion, leads 
thelist. This statement is not intended as a criti- 
cism of teachers; it is a situation brought about by 
a dearth of actual research on the one hand, and a 
literature replete with conflicting and unsupported 
opinion on the other. 

Because of continued disagreement on this prob- 
lem, the writer has undertaken this study.! It 
extended over a period of eight years and consisted 
of a review of the literature as wellas further re- 
Search to find answers to the many unanswered 
questions in this area. Because of the length of the 
work, it will be reported in two parts. The present 
paper will concern itself with the extent of the use 
of the left hand in handwriting. A later study will 
take up the relative efficiency ofthevarious ways of 
writing with the left hand. 


Previous Studies 


А review of reports on general left preference 
that have appeared in the literature through the 
years presents a picture of wide disagreement in 
findings. To illustrate, itwas foundthatthe percent 
of persons with general left preference ranged from 
2 to 29.7. Findings based upon a combination of 
left-hand activities ranged from 2.8 percent to 11. 6 
percent. Use of the left hand in handwriting ranged 
from 2 to 6.4 percent. Left-eyedness has been re- 
ported to be 25 and 29. 7 percent. Left hand advan- 
tage in strength of grip is reported to be 28.8 per- 
cent. Blau,2 after an extensive review of there- 
Search, concluded that ‘‘when we start dealing with 

figures on the incidence ofsinistrality, we cannot 
find a single one that is authoritative. ”’ 

Similar, but not so great, disagreement is found 
in the literature regarding the extent of left-handed- 
ness in writing. Estimates by educators, psychol- 
ogists, and others, range from 4 to 8 percent. Ap- 
parently more boys than girls prefer the left hand. 

Disagreement in findings may bethe result of one 
or more of these factors: 

1. Intesting method employed, as for example, 
use of the left hand in various activities, strength of 


* Footnotes appear at the end of the article. 
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grip, dexterity, measurement of bones, carrying 
hand, throwing hand, writing hand, hand clasping, 
tapping. 

2. In phase of sidedness tested— handedness, 
eyedness, leg and foot preference, ear preference, 
various combinations. 

3. Inthe group tested— normal, abnormal, sub- 
normal, delinquent, epileptic, special population 
group, such as army. 

4. Inthe age group tested— pre-school, pri- 
mary, intermediate, advanced, high school,univer- 
sity, general adult. 

5. In the degree of accuracy in testing procedure. 

6. In the validity of the test used. 

7. In the size of the group tested. 

8. In birth status— single born, twin, identical 
twin, mirror-image twin. 

9. In sex. 


The second phase of the present report was car- 
ried out іп a four-state area convenient to the writ- 
er. Answers to the following questions were sought: 

1. What is the extent of the use of the left hand 
in handwriting in the combined grades, one through 
six, at the present time? 

2. What is the extent of the use of the left hand 
in handwriting in each separate grade from one 
through six? 

3. What is the relationship between sex and the 
use of the left hand in handwriting? 

The study was limited to a geographic area in- 
cluding certain schools in Pennsylvania, Ohio, and 
a few schools in New Jersey and New York. The 
schools included subscribe to a handwriting pro- 
gram 3 that has for many years discouraged forc- 
ing children with strong left preference to write with 
the right hand. 

The count was further delimited to left-handed 
writers in grades one throughsix, and thus probab- 
ly should not be compared with studies using other 
scholastic or age groups. 

The data to determine the extent of the use of the 
left hand in handwriting were obtained by means of 
a short questionnaire, and this was given to over 
10, 000 teachers in conjunction with the handwriting 
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program3 used intheir schools. Inaddition to the 
identification of the room, including grade, the 
questionnaire asked for the total number of boys, 
number of boys writing with the left hand, and the 
total number of girls and the number of girls writ- 
ing with the left hand. 

Over 9,000 returns were received, permitting 
representative tabulation and the elimination of any 
questionable reports. Only carefully completed re- 
turns were used and all split-grade (two or more 
grades in a room) questionnaires were eliminated 
from the tabulation. This procedure tended to elim- 
inate errors for the separate grades. 

Reliability of the figures representing the num- 
ber of left-handed writers was checked by combin- 
ing 50 classrooms of pupils in each grade from one 
through six into blocks of 300 rooms. The first of 
these blocks was totaled and the various percents 
calculated for the number of left-handed boys, left- 
handed girls, and the total left-handed pupils. Block 
two was added, and the totals and percents were a- 
gain determined. This procedure was continued un- 
til a stabilized finding was obtained. Tensuch 
blocks proved more than ample. A stabilized find- 
ing was found when block seven was added to the six 
previous blocks, This percent was 11.1. When 
block eight was added, the percent was again 11.1. 
The same percent was obtained when blocks nine and 
and ten were added. This seemed to indicate a uni- 
versal finding in the geographic area covered by the 
Study. 

In obtaining the percent of left-handed pupils (by 
Sex and combined sexes) in each grade, the ten re- 
cording sheets for each weresimply tabulated.This, 
then, would give the percent of left-handed writers 
in 500 rooms in each grade. 


The Findings 


The final tabulation included 3, 000 rooms, 
48, 009 boys, and 44,647 girls, making a total of 
92,656 pupils. Тһе percent of total pupils writing 
with the left hand was stabilized at 11. 1. The per- 
cent of boys using the left hand was 12.5 and of 
girls, 9. 7. These findings, with but slight fraction- 
al variations, also held for each grade, one through 
Six. 


Conclusions 
ааа 


It can, it would seem, besafely statedthat іп the 
geographic area covered by the study, approximate- 
ly 11 percent of all pupils are using the left hand in 
handwriting in grades one throughsix. This num- 
ber is greater than previously гер orted by Selzer 4 
(one to two percent) andPalmer?  (sixpercent). 
The 11 percent finding is also generally higher than 
that usually estimated by educators in the literature? 

Further, previous reports showed a gradual de- 
crease in the percent of left-handed pupils with ad- 
Vancing grades and age." This was not found to 


be true in the present study. It is likely that such 
earlier findings were affected by the gradual retire- 
ment of teachers who insisted on right-handed wzit- 
ing for all pupils. The philosophy that once dictated 
the use of the right hand in handwriting grad ually 
gave way to the newer thinking that permits left- 
handed pupils with strong preference to write with 
the left hand. This, of course, first affected pri- 
mary grades where the use of either hand is usually 
established. During the transitional period, it 
would be normal that more pupils were reported us- 
ing the left hand in beginning grades, and this would 
account for the appearance of a gradual reduction in 
left usage with advancing grades. Atpresent (in the 
study area), practically all teachers in all grades 
permit left-handed writing; therefore the percent for 
each grade is now found to be high and generally 
constant. 

Also, it may be safely concluded that more than 
12 percent of the boys and more than 9 percent of 
the girls write with the left hand ineach grade from 
one to six. The many previous reports of more 
boys than girls using the left hand are Supported by 
this study. There are not, however, (within the 
Scope of this study) approximately tw ice as m any 
boys as girls with left preference as reported for 
handedness in general by Blau.8 

The second part of this study, which will appear 
ina later issue of this journal, will present data on 
“The Relative Efficiency of the Various Approaches 
to Writing with the Left Hand. ” 
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* Тһе Peterson System of Directed Handwriting, 
Greensburg, Pa. 
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What is Television Doing to Our Children? 


MANY EDUCATORS and parents are concerned 
about the cumulative effects of television programs 
on thetotal personality of the American children. 
Before any evaluation can be made now, orinthe 
future, we must have more information with which 
to work. 

In view of the need for further research in this 
field, a study was conducted with the following ob- 
jectives in mind: the specific purpose was to as- 
certain the televiewing habits of junior high school 
Students in the public schools of Amarillo, Texas 
in 1955 and to follow these same students. into high 
School for the same data on their televiewing habits 
in 1958. 

The secondary purposes ofthe study were: (1) to 


determine the average viewing time of the students 
on school days and on weekends; (2) to determine 


the types of programs being viewed by most of the 
children; (3) to determine the types of programs 
that appealed least to the students, and (4) to de- 
termine whether the entirefamily watched programs 
together. 

At the time studies were made, television cov- 
erage for the Amarillo areawasfrom the two 
commercial stations in the city of Amarillo, namely 
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KGNC-TV and KFDA-TV. Amarillo is notwithin 
range of an educational or non-commercial tel - 


evision station, so this material regards commer- 
cial viewing only. 

In setting up this study, it was decided that a 
questionnaire would be used to gather the data. It 
was submitted to the junior high school students 
(seventh, eighth and ninth grades) in1955. The same 
questionnaire was again submitted to the same stu- 
dents when they reached high school (tenth, eleventh 
and twelfth grades) in 1958. Ten questions were 
asked. These were answered by approximately 3500 
students each time it was submitted. Approximately 
2000 of the 3500 students were still attending school 
when the second questionnaire was tended. The 
following results are based on the reactions of the 
2000 students who filled in the questionnaire both 
times (see table on next page). 

Any medium which absorbs as much of a student's 


time as television is almost certain to have some 
effect on his sense of values and on his sensitivity 


to his environment. Itseems that parents andothers 
concerned should help to makethe best possible 
selections and improvement of the material avail- 
able. 
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JR. HIGH SCHOOL HIGH SCHOOL 


Homes with TV sets 92. 66% 99. 13% 
Homes with more than one set 7.05% 28.00% 
Students choosing own program 13. 65% 28. 43% 
Choosing own program with parents help 86. 35% "1. 57% 
Less active now than before television 21. 25% 13. 21% 
Read less than before television 40. 21% 32. 64% 
Played less in groups than before tele- 

vision 21. 45%, 14. 20% 
Studied before watching television at 

night 16. 55% 81. 40% 
Studied after watching television at 

night 41.19% 34. 97% 
Studied less than before television 36. 09% 20. 29% 
Average viewing time per school day 2 hrs. 55 mn. 2 hrs. 25mn. 
Average viewing time per weekend 8 hrs. 49 mn. 5 hrs. 42 mn. 


Average time spent per year watching 
television 1216 hrs. 923 hrs. 


*Maximum time spent per year in class- 
room 1144 hrs. 1144 hrs. 


*Based on requirements of state supported high schools and junior high schools in Texas 
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MENTAL HEALTH AND SCHOOL ADMINISTRATION. .... 

The University Council for Educational 
Administration sponsored three regional seminars on mental health in school ad- 
ministration. The seminars were designed to provide professors of school admin- 
istration and socialscientists opportunitiesto explore their interests in doing basic 
research on problems which have implications for mental health. Officials in the 
National Institute of Mental Health provided information on NIMH grants and their 
potential uses to institutions preparing school administrators. NIMH offers three 
kinds of grants: 

1. Research Grants. These grants are made to fund studies which are tightly 
and rigorously designed and are oriented toward basic research. 

2. Mental Health Project Grants. These grants can be appropriated for a va- 
riety of purposes. With regard to school administration, they can fund 
proposals which would 

a) develop information and knowledge about m ental health functions in 
School districts which can be applied in the administration of schools 
toward betterment of mental health, 

b) implement studies to shed light on mental health functions and prob- 
lems in school districts needed as a basis for developing training 
grant proposals, and 

c) provide limited health and in-service training projects. 

3. Training Branch Grants. Funds are available for subsidizing certain pilot 
projects to test new approaches to the pre-service training of personnel 
who have responsibility for mental health functions. 

These meetings were held in Philadelphia, Chicago, and Denver on October 30, 

and 31, November 1 and 2, and November, 3 and 4, respectively. 


ESTABLISHMENT OF A BUREAU OF EDUCATIONAL RESEARCH AT SAN FRANCISCO 
STATE COLLEGE..... 

The Division of Education of San Fran- 
cisco State College has approvedthe establishment of a Bureau of Educational Re- 
search. Anticipated services are: 

1. Consultation assistance for student research projects 
2. Consultation assistance for faculty research projects 
3. Consultation assistance in the preparation of research proposals 
4. Dissemination of educational research information to division faculty 
5. Dissemination of information regarding educational research activities of 
the college 
W. James Popham will be the Director of this bureau. 


SCHOLARSHIP AWARDS FOR PUBLISHED ARTICLES. .... 
The School Boardfor the Spring Branch 
Independent School District of Houston, Texas, offers scholarships to teachers and 
other professional people in the district who have articles published in professional 
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periodicals. A scholarship of $150 for an article published in a state periodical 
and $250 for an article published in a national periodical may be used for travel or 
School expense, with the only qualification being that the by-line or the content of 
the article give recognition to and identification with the school district. 


FACTOR ANALYSIS OF DIMENSIONS OF ADMINISTRATIVE BEHAVIOR..... 

A three-year study has just been com- 
pleted by Educational Testing Service and Teachers College, Columbia University, 
on in-basket responses of elementary school principals. Two hundred thirty-two 
elementary school principals classified materials found in their in-baskets into 
sixty-eight scoring categories. Eightfirst-order factors were found to account for 
most of the variance. These were designated ав: Exchanging information, discuss- 
ing for acting, complying with suggestions, analyzing a situation, maintaining re- 
lationships, organizing work, responding to outsiders, directingothers. Two sec- 
ond order factors were found. These were designated as: pr e paring for decision 
versus taking final action, and amount of work. 

These dimensions of behavior indicate 
the nature of evaluations which frequently are made by principals. 


JEROME E. LEAVITT—RESEARCH REPORTER FOR THE ААСТЕ..... 
Dr. Jerome F. Leavitt, Professor of 
Education, Division of Education, Portland State College, has been appointed as а 
research reporter for the American Association for Colleges of Teacher Education. 
Acting in this capacity, he will send reports of educational research underway to 
the headquarters of the Association. 


MENTALLY HANDICAPPED CHILDREN. .... 
Dr. А. M. Jordan reports that an inves- 


tigation of the mental, social, and personal characteristics of mentally handicapped 
children has been underway for the past two years under the leadershipof Mrs. 
Thelma G. Thurstone inthe School of Education of the University of North Carolina. 


EDUCATION AND JUVENILE DELINQUENCY IN EUROPE..... 
Dr. William C. Kvaraceus of Boston 


University is conducting a year-long study of the role of school and out-of-school 
education for the prevention and treatment of juvenile delinquency in various Euro- 
pean countries. Supported by UNESCO funds, he will organize an international 
meeting of institutes dealing with adolescent problems. This meeting will be held 
in Europe in March 1962. 


RESEARCH AT WESTERN MICHIGAN UNIVERSITY. .... 
Dr. George G. Mallinson, Dean of the 


School of Graduate Studies, reports that four studies are underway: 
1. An analysis of the factors related to the motivation and achievement of stu- 
dents in science courses in the junior and senior high school. 
2. The use of student reactions to teachers for the pur pose of improving in- 
struction in public elementary and secondary schools. 
3. Educational problems of migrant children. 
4. Creative thinking in children at the junior high-school level. 


Readers are invited to send items of research int erest to 
John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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It Says in the JER... 


IF YOU USE A FORMULA for figuring scores on multiple-choice tests, you're prob- 
ably in error, Little says. Correction formulae are frequently used to 
compensate for guessworkon suchtests on the assumption that the reduced 
Score gives a more accurate picture of the student's actual ability. On the 
basis of his research, Little believes this isn't so, and recommends that 
**use of correction formulae be abandoned. ’’ 


* ж * 


Powell, Street, and Hamblen find that STUDENTS 
IN LARGER SCHOOLS TEND TO OUT-PERFORM stu- 
dents in smaller schools. While the study was limited 
to schools of less than 900 enrollment in rural and min- 
ing areas of Kentucky, the writers believe their data may 
apply to many debates about school size. 
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HOW А SCHOOL RUNS ITS SPEECH PROGRAM can be а clue to how well that school 
is educating for democratic living, says Erickson, if you assume that stu- 
dents attending a ‘‘democratic’’ school will later become more responsible 
in their adult citizenship. In thirty Minnesota schools he analyzed the ex- 
tent to which the students themselves believe they influence the management 
of speech activities. Scores ranged from 6.4 to 19.5. In general, Erickson 
finds the following factors conducive to student control in speech activities 
and hence to а non-autocratic environment: 1) extensive student partici- 
pation in speech activities, 2) small school size, 3) high district wealth, and 
4) high school expenditure level. ‘‘When schools exist at a bare minimum 
level, decision-making is not left to students, ’’ Erickson says. 

жжж 
Continued оп page 260..... 
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Overcorrection for Guessing in Multiple-Choice Test Scoring 


THERE ARE SEVERAL procedures for obtaining 
scores оп multiple-choice tests. The number of cor- 
rect responses, the number of correct responses 
minus the number of wrong responses, the number 
of correct responses minus one-half the wrong re- 
sponses, the number of correct responses minus 
one-quarter the wrong responses, are a few of the 
methods used in determining scores. Generally the 
number of correct responses is reduced by an amount 
equal to the number of wrong responses divided by 
one less than the number of options per test item. 


Theformulais S-R -7n- , Where R is the number of 
correct responses, W is the number of wrong re- 


sponses, and N is the number of choices per test 
item. 

Doppelt, in Test Service Bulletin No. 46, January 
1954, indicates that wrong responses are not all 
given in accordance with the laws of chance. Some 
may be given because of misinformation. He also 
points out that the formulas used are based on ques- 
tionable assumptions. Itis assumed that all the wrong 
answers plus some of the right ones are chance an- 
Swers or guesses. Itis further assumed that the re- 
duced score obtained by use of the correction for- 
mula, gives amore accurate picture of the students 
actual ability. This reduced score is supposed to 
indicate the number of answers really known by the 
person taking the test. Doppelt further states that 
corrected scores are an attempt to rule out the ef- 
fects of differential boldness in taking a test, rather 
than a method for getting a true picture of the exam- 
inees’ knowledge. Some student are bold, andanswer 
Questions when they are not sure of the answers, 
While their more timid colleagues would rather omit 
those questions. 

Cronbach and othershave found some évidence of 
“асашевсенсе”” operating in taking tests, since do- 
not-guessinstructions placed ascendant students at 
ап advantage over submissive students. It is argued 
that such instructions reduce the validity of achieve- 
Ment, since they become in some degree measures 
of personality traits. 

Some educators maintain that a student taking a 
true-false examination has a fifty percent chance of 
guessing the correct answer ifhe knows nothing a- 
bout the question. However, L.L. Thurstone in his 


book: The Reliability and Validity of Tests indicates 


that the weighting which is given to errors by the 
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chance formulas is based on the theory of probability 
and for a given examination may be considerably in 
error. 

The actual test situation, however, for agiven ex- 
amination involves a number of factors other than 
chance. The student is not likely to be absolutely 
ignorant concerning the question asked. More likely 
he has various degrees of (1) positive correct in- 
formation, (2) partial information leading to a cor- 
rect answer, (3) total lackof information, selections 
made by pure guess, (4) partial information leading 
to an incorrect answer, (5) positive misinformation 
leading to an incorrect answer, (6) afeeling that the 
test question is stupid, he knows the answer that is 
wanted, but doesnotagree and will not give the cor- 
rect answer. 

Horst has shown mathematically that in the case 
of a multiple-choice item where the correct answer 
is the most popular choice and the choices constitute 
a graded series of steps of knowledge about the point 
being tested, thepercent of the examinees selecting 
the most popular incorrect choice is equal to the 
percent marking the correct answer by pure chance 


The Problem 1. Will a test pattern analysis of the 
wrong answers on a test given to the same group of 
students as a pre-test (before taking course) po st- 
test (immediately after course) and re-test (1 year 
after course completion) give any indication as to 
which wrong answers are due toguessing and which 
are due to other factors ? 

2. What will a test pattern analysis of wrong an- 
Swers on a pre-test, post-test, and re-test 
show as to the average’ of the most popular 
incorrect answer on each test? 

3. МШ а comparison of correction factors as 
obtained by the conventional formula and as 
obtained by using information obtained by the 
above test pattern analysis show that the con- 
ventional formula is overcorrecting? 

The following hypotheses were investigated: 

A. That a test pattern analysis of the same 
test given at three different times to the 
same group of students will show that 
some wrong answers are not marked in 
accordance with the laws of chance but 
instead are basedon positive misinform- 
ation. 
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B. That the basic assumption underlying the 
correction for guessing is misleading 
and may make it appear that the exam- 
inee knows the answers to fewer quest- 
ions than he really does know. 


Method and Procedure In an attempt to investigate 
these questions a test pattern analysis was under- 
taken on a multiple-choice test which was adminis- 
teredthreedifferenttimes. The test was first given 
asapre-test, then as afinal examination four months 
later, and finally as a re-test one year later to the 
same group of students. 

In order to insure a normal population the stud- 
ents' sten scores on the Michigan Vocabulary Test 
меге used as a basis for student selection. The se- 
lection was made in the following way: one student 
each with sten scores of one and ten, one student 
each with sten scores of two and nine, two students 
each with sten scores of three and eight, four students 
each with sten scores of five and six. 

Thetestitems were compiledby a jury of several 
members of the Department of Biological Sciences, 
Chicago Undergraduate Division of the University 
of Illinois. The reliability of the test was calculated 
in terms of the Pearson product-moment method of 
correlation, using the split-half technique. The co- 
efficient was then corrected by the Spearman-Brown 
formula. The reliability coefficient was found to be 
.88. The probable error was not more than plus or 
minus . 034. 

The claim may be made that the test used has 
validity, since there was a high degree of concur- 
rence of three judges in the Department of Biology 
as to what the correct answers ought tobe. The test 
was given to the three judges and each judge made a 
key. The three keys were then compared and only 
items upon which the judges agreed were used as test 
items for this study , and item possibilities not 
marked as the judges agreed that they should be 
marked were counted as wrong answers. A test pat- 
tern analysis of these wrong responses on all three 
tests and of their use in the correction formula was 
then undertaken. The way in which each student dealt 
with each item possibility on each test was carefully 
followed. 'Eachitem possibility was studied according 
to the following pattern: an individual item possibil- 
ity could be wrongonallthreetests (pre-test, post- 
test, re-test),inwhichcaseit was designated WWW. 
It could also be wrong on the pre-test, post-test, 
and blank on the re-test, WWB, blank on the pre- 
test, and wrongonthe post-test, andre-test, BWW, 
wrong on the pre-test, blank on the post-test, and 
Wrong on the re-test, WBW. In addition an individ- 
ual item possibility could be wrong on the pre-test, 
and blank on the post-test and re-test, WBB, blank 
on pre-test and post-test, and wrong on the re-test, 
BBW, and blank on the pre-test, wrongonthe post- 

test, and blank on the re-test, BWB. 

In the same fashion a correct response could be 
correct onall three tests, RRR, right, right, blank, 
RRB, blank, right, right, BRR, right, blank, right, 


RBR, right, blank, blank, RBB, blank, blank, right, 
BBR, blank, right, blank, BRB. Anitem possibility 
which was blank on all three tests was designated 
BBB. 

Results The results relating to hypothesis A are 


shown in Table І. The test pattern analysis indicates 


an average of 1.57 responses marked consistently 
wrong on all three tests. An answer marked con- 
sistently wrong would seem to indicate misinforma- 
tion and we may assume that this figure of 1.57 re- 
presents the average number of wrong answers based 
on positive misinformation. There then follows a 
catagory of three test possibility areas, WWB, BWW, 
and WBW. These areas are close relatives of the 
WWW area since they are predominantly consist- 

ently wrong, they are wrong two times out of three 
and such a percentage would seem to indicate that 
misinformation or partial information leading to false 
conclusions is predominant in the student’s thinking. 

These results indicate that we may accept hypoth- 
esis A and state that a test pattern analysis of the 
same test given at threedifferent times to the same 
group of students shows that some wrong ans wers 
are not marked in accordance with the laws of chance, 
but instead are based on misinformation. 

The following results have been gathered in sup- 
port of hypothesis B; (1) If the four catagories, WWW, 
WWB, BWW, and WBW are considered together as 
representing responses based primarily on misinfor- 
mation, itisfound that by adding WWW-1.57, WWB 
-1.73 and WBW-2.57 a figure of 5.87 is obtained 
which when subtracted from 13.06 average wrong 
answers onthe pre-test, givesafigure of 7.19. This 
figure agrees withthe average number of wrong an- 
swers under the catagory of WBB-7. 19 and is an ар- 
proximation of the average number of wrong ans w ers 
that are wrong because of guessing on the pre-test. 
This figure of 7. 19 when used in the conventional cor- 
rection formula gives acorrection factor of 3.54 as 
Shown in Table П. When this average experimental 
correction factor is subtracted from the average con- 
ventional correction factor of 6. 88 itis found that the 
average difference or average overcorrection is 3. 
34 answers. 

The same is found to be true by adding WWW, 
WWB, and BWW for the post-test correction, and 
WWW, BWW, and WBW for the re-test correction. 
For the post-test a figure of 6. 49 is found and when 
this figureis subtracted from the average total num- 
ber wrong on the post-test, 12. 68, a figure of 6.19 
is obtained. This figure agrees with the average 
number of answers thatare wrong on the post-test 
only. This figure of 6.19 when used in the conven- 
tional correction formula gives an average correc- 
tion factor of 3.01 as shown in Table II. For the 
post-test the averageconventional correction factor 
is 6. 25 and the average difference ог overcorrection 
is 3.24 answers. When the above same calculations 
are carried on for the re-test it isfound that the 
average conventional and experimental correction 
factors are 8. 17 and 4.34 respectively with the av- 
erage difference or overcorrection of the convention- 


al method being 3.83 answers. 

(2) It is further interesting tonote that if each in- 
dividual pre-testscoreis corrected for guessing by 
use of the conventional correction formula, the av- 
erage corrected score is 23.83 or a score which is 
less than the average score for RRR which is 24. 88 
as shown in Tables I and III. Itis assumed here that 
a response correctly given on three different oc- 
casions indicates thatthe student possesses correct 
information about the particular point being tested. 
Further, whenthe effects of forgetting are considered 
it is reasonable to estimate thatthe average original 
pre-test scorefor answers really known by the stu- 
dents at the time of thepre-test would be somewhat 
greater than the 24.88 average number of answers 
known on all three test administrations over a total 
period of one year. This is a further indicationthat 
overcorrection is taking place when the conventional 
formula is used. 

(3) As shown in Table IV the calculation of the 
Significance of the difference between the average 
uncorrected score and the average corrected score, 
where the correction is made by using the conven- 


tional correction formula, Score = R- 1-1; indicates 
that the difference is due to more than chance. The 
average differences are 6.88, 6.25, and 8.17 for 
the pre-test, post-test and re-test respectively. T 
Scores of 8. 6, 11.0, and 13. 6 were obtained for the 
pre-test, post-test, and re-test respectively. 

If all of the wrong answers were due to chance 
alone such evidence should be non-existant, but as 
shown in Table IV it is absolute certainty in each 
case that the difference is due to more than chance. 

(4) Horst has shown mathematically that in the case 
ofa multiple-choice item, where the correct answer 
is the most popular choice and the choices constitute a 
graded series of steps of knowledge aboutthe 
point being tested, the number of the examinees 
Selecting the most popular incorrect choice is 
equal to the number marking the correct answer 
by pure chance. When the pre-test, post-test, and 
re-test were analyzed in this fashion it was found 
that there were 4. 08 average most popular incor- 
rect choices on the pre-test, 4.13 average most 
popular incorrect choices onthe post-test, and 
5.37 average most popular incorrect choices on 
re-test. These averages compare favorably 
With the experimentally calculatedaverages of 
3.54, 3.01, and 4.34 for the pre-test, post-test, 
and re-test respectively which apearin Table 
IL This is still further proof that the conven- 
tional correction formula is overcorr ecting 
for guessing. 

(5) It is interesting to note as shown in Table III 
that the students **Plunge" in thenew blank area on 
each test administration. The greatest number of 
Wrong answers are in areas not formerly attempted. 
For example, onthe post-test the greatest number 
9f wrong answers arenotinthe area of former wrong 
answers on the pre-test but in the area formerly 
blank on the pre-test, also, on the retest, the 
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greatest number of wrong answers аге in the area 
formerly blank on both the pre-test and post-test. 
It is from scores for wrong answers in these areas 
that a more accurate estimate of a student's ‘‘bold- 
ness” or his willingness to take a chance may be 
calculated. This is additional evidence that the score 
representing the total number wrong answers 

Should not be used in the conventional correction 
formula to determine the number of correct answers 
that are correct due to chance alone. 

We may now accept hypothesis B and state that 
the basic assumption underlying the correction for 
guessing is misleading and may make it appear that 
the examinee knows the answerstofewer questions 
than he really does know. 


Discussion The results of this study have shown 
that there àre other than chance factors involved in 
Student selection of multiple-choice item po s s ibil- 
ities on multiple-choice tests. Student information 
regarding an item possibility follows a continuum 
from positive correct information all the way through 
the various degrees of fringe information and no in- 
formation at all, to positive misinformation. Asa 
result, when the total number of wrong answers on 
a multiple-choice test are used in the correction 
formula they yield a correction factor which over- 
corrects, andgives the studenta lower estimate than 
is warranted of his true knowledge and ability. 

Strictly speaking, as indicated by Gulliksen, the 
correction formula is needed only when a student has 
omitted а fairly large ог a fairly small number of 
item possibilities. Otherwise, as shown by Stanley, 
the rank order of the students will be unchanged, 
regardless of whether or not their scores are cor- 
rected for guessing. 

The fundamental purpose in giving a test is to 
obtain samples of behavior which will permit com- 
parisons with respect to some reasonably well de- 
fined knowledge, skill, or attribute among individ- 
uals inthe group tested. Itis, or should be, а method 
for getting a true picture of the examinee’s knowl- 
edge. 

w is the author’s opinion, based on the results 
of this research that in order to get a true picture 
of the examinee’s knowledge, the schools should 
discontinue all use of the correction formula. In its 
placethe author recommends inspectionof multiple- 
choice answer sheets and hand-grading of those an- 
swer sheets that show unusual patterns of either too 
few or too many omissions. These are easy to loc- 
ate, andare usually few in number. In this way pro- 
per handling of individual cases can be accomplished 
without penalty to the whole group. 

If the correction for guessing formula is to be 
used, however, byteachers who feel that they must 
use it, then only numbers which are known to ap- 
proximate the true score for guesses should be used 
in score calculations. Since scores for guesses are 
difficult to determine for a single test administra- 
tion, some simple modification of the present system 
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Should be agreed upon and standardized. is par- 
ticular experiment it was found that R-3 3-1 gavea 
correction factor which closely approximated the 
actual number of correct answers that were correct 
duetoguessing. This modified correction factor was 
also in close agreement with the correction factor 
obtained by the most popular incorrect answer meth- 
od as used by Horst. 


Conclusion Hypothesis A states that a test pattern 
analysis of the same test given at three different 
times to the same group of students will show that 
Some wrong answers are not marked in accordance 
with thelaws of chance but instead are based on 
positive misinformation. The scores obtained for 
hypothesis A in this experiment indicate that some 
wrong answers are based on misinformation. We may, 
therefore, accept hypothesis A. 

Hypothesis B states that the basic assum ption 
underlying the correction for guessing is misleading 
and may make it appear that the examinee knows 
the answers to fewer questions than he really does 
know. 

The following scores were obtained in this ex- 
periment in support of hypothesis B: 

(1) The scores for wrong answers approximating 
pure guesses, 7.19, 6.19, and 9.80 onthe pre-test, 
post-test, and re-test respectively (scores obtained 
by subtracting the number of wrong answers due to 
misinformation from the total number of wrong an- 
Swers) when used inthe conventional correction for- 
mula give correction factors of 3.54, 3.01, and 4.34 
on the pre-test, post-test and re-test respectively. 
These correction factors approximate the number of 
right answers which are pure guesses. They agree 
favorably with a test item analysis by the Horst 
method in which the average most popular wrong an- 
Swer scores were found to be 4. 08, 4.13, and 5.37 
on the pre-test, post-test and re-test respectively. 

(2) Theaverage number of answers correct on all 
three test administrations (RRR-24.88) exceeds the 
average number of pre-test answers correct as ob- 
tained by use of the conventional correction formula 
(23.83). This isa further indication that scores are 
being overcorrected when the conventional formula 
is used. 

(3) In this experiment the calculation of the sig- 
nificance of the difference between an uncorrected 
Score and a score corrected by use of the conven- 
tional formula (Score = R -74; indicates that the 
difference is due to more than chance. 

(4) An analysis of the three tests to determine 
the most popular incorrect answer average on each 
test gave figures of 4.08, 4.13, and 5. 37 respect- 
ively on the pre-test, post-test, and re-test. Each 


of these figures is less than the figure obtained by 
use of the conventional correction formula. 

(5) The greater number of incorrect answers on 
the post-test, and re-test are in blank areas never 
before attempted by the student. This figure is less 
than the total number of incorrectanswers, but still 
is not a precise estimate of the wrong answers 
which are due to guessing alone. 

АП of the above scores indicate that the total 
number of wrong responses should not be used in 
the conventional correction formula for the calcula- 
tion of the number of correct answers that aredue 
to chance alone. We may therefore accept hypoth - 
esis B. 

For this experimental group the theoretical cor- 
rection for chance reduced the total scores. The 
assumption that this reduced score gives amore ac- 
curate picture of the student's actual ability was 
found to be incorrect. 

The author would prefer to recommend, there- 
fore,that use of the correction formula be aban- 
doned, but for those teachers who find this impos- 
Sible, the author suggests a modification of the cor- 
rection formula. In this experiment, the formula, 
Score = R -4 mets was found to yield a more accur- 
ate total score. 
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TABLE I 


TABLE SHOWING TEST PATTERN ANALYSIS. THE FIGURES SHOWN ARE 
THE AVERAGE NUMBER OF ITEMS ON THE THREE TESTS (PRE-TEST, 


POST-TEST, AND RE-TEST) FOR EACH OF THE INDIC ATED CATAGORIES 


RRR 24. 88 BBB 
RRB 3.23 RBB 
BRR 17.95 BBR 
RBR 1.57 BRB 
WBB 
BBW 
BWB 


118. 53 
1.03 
4.38 

10.19 
7.19 
9.80 
6.19 


WWW 
WWB 


BWW 


1.57 
1.73 
3.19 
2.57 
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TABLE II 


SCORE CORRECTION FACTORS AS CALCULATED BY THE CONVENTIONAL CORREC TION 
FORMULA, AND AS CALCULATED BY THE EXPERIMENTAL 
METHOD, SHOWING THE DIFFERENCE OR 
OVERCORREC TION 


Correction Factor 


Answers Wrong Correction Factor Difference 
Total Number On This One As Calculated by as Calculated by ог over- 
est Wrong Answers Test Only Conventional Formula Experimental Method correction 
Pre-test 13.06 7.19 6.88 3.54 3.34 
Post-test 12. 68 6.19 6.25 3.01 3.24 
Re-test 17.13 9.80 8.17 4.34 3.83 
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Pre- 
test 


Post- 
test 


Re- 
test 


30.71 


R B 
24.88 3.23 


2.60 
B B 
1:57-— 21-03 


TABLE Ш 


GENETIC TYPE ANALYSIS CHART 


B 
170.23 
R B 
28.14 132.71 
R B R B Ww 


17.95 10.19 4.38 118.53 9.80 


w 
9.38 
w B 
3.19 6.19 


13.06 


/\ 


3.30 


Ww B w 
1.57 1.73 2.57 


B 
9.76 


B 
7.19 
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TABLE IV 


TABLE SHOWING SIGNIFICANCE OF DIFFERENCE BETWEEN THE AV- 
ERAGE UNCORRECTED SCORE AND THE AVERAGE SCORE CORREC TED 
BY USE OF CONVENTIONAL CORREC TION FORMULA. 


Uncorrected Corrected Score Certainly that 

Score for cor- for Correct difference is due 
Test rect Answers Answers Difference T to more than chance 
Pre-est 30.71 23.83 6.88 8.6 Absolute 
Post-test 56.25 50. 00 6.25 11.0 Absolute 
Re-test 48.78 40.61 8.17 13.6 Absolute 
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A Factor-Analytic Study of Critical Thinking 


IN THE LAST TWO or three decades it would ap- 
pear that educators have become concerned with 
what the school can do about teaching critical think- 
ing skills. Moreover, if the school is going to give 
training in these skills, then teachers and adminis- 
trators are involved both in problems of content and 
methodology and also in problems of measuring pro- 
gress toward these goals. 


The Illinois Critical Thinking Project 


Perhaps one of the first large scale attempts to 
teach ‘‘critical thinking" was the Critical Thinking 
Project (Smith, Henderson, Swift, Ennis, Aschner, 
and Payette, in press) of the Illinois Curriculum 
Program Committee, which was launched in Septem- 
ber, 1954. Тһе teachers and project staff members 
responsible for the planning of this project conceiv- 
ed of critical thinking as the process of evaluating 
arguments and assessing the way in which conclu- 
Sions are reached. Lists of critical thinking abili- 
ties and items of knowledge were drawn up, and in 
the summer of 1954 a student handbook entitled 
Guide to Clear Thinking (Henderson, Smith, Swift, 
and Aschner, 154) was prepared. Nearly three 
thousand students in five Illinois high schools were 
involved in this project. Approximately one-third 
of these students served as a control group. Тһе 
remaining students weregiveninstruction and prac- 
tice in processes of critical thinking in connection 
with the conventional subjects: English, social stud- 
les, science, and mathematics. 

In September, 1955, differences between the ex- 
perimental and control groups were measured. Two 
Standardized tests of criticalthinking were selected 
for this purpose, the Watson-Glaser Critical Think- 
ing Appraisal, Form Bm, and А Test of Critical 
Thinking, Form б, prepared by the American Coun- 
cil on Education (hereafter called the ACE test). As 
another evaluation technique the staff used A Test 
Оп Principles of Critical Thinking, Form ЕТ 5, 
Which was especially designed to dis cover whether 
the students knew principles of critical thinking. 


— 2 E 220020020000 


VELMA |. RUST, R. STEWART JONES, апа 
HENRY F. KAISER, University of Illinois 


Various achievement tests were also administered. 
On the whole, the findings were not conclusive. 
Only some differences bet ween experimental and 
control groups were Statistically significant. One 
finding of particular interest was the low intercor- 
relations obtained among the tests of critical think- 
ing. 


Factor Analyses of Three Tests of Critical Thinking 


From the evidence available, the Illinois Curric- 
ulum Program Committee was forced to admit that 
knowledge of principles of critical thinking was not 
so closely related to the ability to evaluate argu- 
ments and assess conclusions drawn as it had anti- 
cipated. Rust (12), therefore, made a study of the 
factor-analytic content of the three measur- 
ing instruments used, in the hope of finding 
whether the three tests do, infact, measure what 
they purport to measure and whether they do mea- 
sure the same skills, abilities, and knowledge. In 
her analysis, Rust used a sample of all 587 students 
from the original group involved іп ће Illinois Crit- 
ical Thinking Project who had taken all three criti- 
cal thinking tests and had attempted all items on 
these tests. This sampling technique was used be- 
cause a factor analysis can much more readily be 
carried out using an electric computer if there is an 
answer for each test item. Moreover, aside inves- 
tigation showed that omitting sample members who 
had not attempted all items had not created bias. 

Factor analyses of the items ofeach of the three 
tests— the Watson-Glaser, the ACE, and the Prin- 
ciples test—were carried out. A single factor an- 
alysis of the items of all three tests at once, which 
would seem desirable, was not possible because of 
the capacity of the electronic computor available. 
Phi correlation coefficients were computed and the 
centroid method of factoring was employed. Thein- 
tention had been to attemptto rotate to oblique sim- 
ple structure. The finding, however, of only one 
weak general factor for each of the three tests made 
carrying the factor analysis procedure further seem 


* The computations for this paper were done on Шіас, an electronic computer of the Digital Computer Labora- 


tory of the University of Illinois. 


254 THE JOURNAL OF EDUCATIONAL RESEARCH 


pointless. These general factors accounted for on- 
ly seven, fifteen, and eight percentof the total var- 
lance for the Watson-Glaser, ACE, and Principles 
tests respectively. 

Other investigators, on the contrary, have re- 
ported that factor analyses of intelligence tests, 
performance tests, reasoning tests, tests of psy- 
chomotor and physical abilities, and tests of occu- 
pational abilities have all yielded a thoroughly es- 
tablished general factor (within a given test). A few 
of these findings are summarized below. 


Other Findings 


Several investigators have factor analyzed rea- 
soning and/or intelligence tests, using procedures 
similar to Rust's. Corter (1952) carried out a fac- 
tor analysis of a battery of tests including reason- 
ing and concept formation tests. He co mputed 
Pearson product- moment correlation coefficients 
and used Thurstone's group centroid method. Seven 
factors were isolated. Guilford and others (1954) 
completed a factor-analytic study of Navy reasoning 
tests with the Air Force Aircrew Classification 
Battery. They used tetrachoric correlations, prin- 
cipal axes factoring, and an orthogonal rotation of 
axes оп the basis of Thurstone’s criteria of positive 
manifold and simple structure. Sixteen factors 
were obtained. 

Botzum (1951) reports that in a study involving 
46 tests, 22 of which were reasoning tests, eight 
first order factors and four second order factors 
were extracted. Green and others (1953) carried 
out a factor-analytic study of reasoning abilities for 
the purpose of exploring abilities considered to be 
important in the successof high-level personnel. 
They isolated twelve interpretable factors. Anan- 
alysis of reasoning tests by Matin and Adkins (1954) 
yielded six second order factors. When he factor 
analyzed the Wechsler Adult Intelligence Scale, Со- 
hen (1957) extracted five factors and one strong 
second order factor. 

Three characteristics are common to most such 
studies. a) Test or subtest scores— not individual 
item scores— have been factor analyzed. b) Al- 
though there is someoverlapping among the factors, 
at least qualitatively the findings seem to show that 
the factors are of two kinds: reasoning and non- 
reasoning. Reasoning factors differfrom others in 

the cognitive area in that they involve the ability to 
reorganize information. Factors thought to Бе rea- 
soning factors were often given labels such as : in- 
duction, deduction, general reasoning, logical rea- 
soning, or eduction of perceptual relations or con- 
ceptual relations, etc. Among the non-reasoning 
actors were frequently found: verbal comprehen- 
sion, numerical facility, perceptual speed, memory, 
risualization, word fluency, etc. с) Loadings оп 
'easoning factors are usually lower than those on 
ther factors, often below 0. 50. 


Rationale for Factor Analyzing Subtests in Prefer- 
ence to Items 


If test makers in their a priori reasoning regard- 
ing grouping of items are correct, then a score on 
a subtest provides more reliable evidence of a sub- 
ject's ability to do a certain kind of reasoning than 
does a score on anindividualtest item. Under rea- 
sonable assumptions, thereliability of a variable 
increases when the number of items in the variable 
increases. This is the ‘‘Spearman-Brown”’ effect 
which Gulliksen (1950) has discussed. When this is 
true, then intercorrelations among subtest scores 
Should be higher than intercorrelations among indi- 
vidual item scores. This implies that factor analy- 
sis of subtest scores should yield stronger factors. 
Intercorrelations among s ubtest scores could also 
be expected to be higher than interitem phi coeffi- 
cients, because of phi’s typically being bounded at 
values (absolutely) less than one. 


Procedure 


In this study, therefore, the authors’ major ob- 
jective was to explore the domain of critical think- 
ing. This they first attempted to do by dividing the 
three critical thinking tests into 21 subtests and fac- 
tor analyzing the subtest scores of the same 587 
subjects used in Rust’s study. It was not difficult 
to divide the 99 items of the Watson-Glaser test in- 
to five subtests since the test-makers had obvious- 
ly grouped their items as follows: Inference (20 
items), Recognition of Assumptions (16 items), De- 
duction (25 items), Interpretation (24 items), and 
Evaluation of Arguments (14 items). In the case of 
the ACE test, the grouping of the 52 items was 
based mainly upon the location of breaks (in the 
form of four asterisks) between groups of similar 
items or items relating to one situation. 

This procedure yielded nine subtests consisting 
of from three to ten items each. Short meaningful 
labels were devised for the subtests for ease of ref- 
erence. Where these differ considerably from the 
wording of the testmakers as found in the manual, 
their wording is givenin parentheses. Theselabels 
were: Direction of Evidence (Ability to Select Per- 
tinent Information, 6 items), Valid Inferences1 
(Ability to Make Valid Inferences and to Judge the 
Validity of Inferences, 4 items), Valid Inferences 2 
(Ability to Make Valid Inferences and to Judge the 
Validity of Inferences, 6 items), Relevant General- 
izations (Ability to Define Problems, 5 items), Re- 
cognition of Assumptions (Ability to Recognize Un- 
stated Assumptions, 10 items), Valid Inferences 3 
(Ability to Recognize Unstated Assumptions, 3 
items), Valid Inferences 4 (Ability to Recognize Un- 
stated Assumptions, 3 items), Hypothesis For ma- 
tion 1 (Ability to Invent and Evaluate Hypotheses, 9 
items), and Hypothesis Formation 2 (Ability to In- 
vent and Evaluate Hypotheses, 6 items). 
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It will be noted that the testmakers list only five 
different abilities. It would be interesting to re- 
peat the procedure outlined here, usingtheir group- 
ing. There is another slight difference between the 
grouping used and that of the testmakers—they in- 
clude item 11 under Direction of Evidence (Ability 
to Select Pertinent Information) instead of under 
Valid Inferences 2. The 53 items of the Principles 
test were divided into seven subtests following nat- 
ural breaks in the test, that is, where the instruc- 
tions to the subject changed. Inferences regarding 
the purpose of a group of items were madefrom 
these instructions and short, seemingly appropriate 
labels devised. This procedure yielded the follow- 
ing subtest titles: Recognition of Definitions (6 i- 
tems), Semantics 1 (7 items), Material Fallacies 
(1 items), Semantics 2 (7 items), Rules of Reason- 
ing (13 items), Identification of C riterion and As- 
sumption (3 times), andValidInferences (10 items). 

Pearson product-moment correlation coeffi- 
cients were computed for the 21 subtest scores. 
The principal axes method of factoring the correla- 
tion matrix was used. As communality approxima- 
tions, unities were placed in the diagonal cells of 
the matrix. When Guttman's (1954) lower bound for 
the number of common factors was applied, the in- 
vestigators were certain they had atleast three fac- 
tors, because the correlation matrix had three lat- 
ent roots (eigenvalues) greater than one: 6.67, 1.20, 
and 1.10. Therefore, the first three principal axes 
were rotated according to the varimax criterion 
(Kaiser, 1958). 


Findings 


The first three principal axes accounted for 
31.76, 5.71, and 5. 23 percent ofthe variance of the 
Subtest scores respectively, while after rotation, 
this variance was redistributed to each of the vari- 
max factors as 24.47, 12.90, and 5.33 percent of 
the total variance of the subtests. The rotated var- 
imax factors are shown in Table I. 

Twelve of the 21 subtests have loadings of 0. 50 
ог higher on Factor A. Five of the loadings are 0- 
ver 0. 60 and one is as high as 0.76. Allof these 
Subtests are in the two tests of critical thinking a- 
bilities and skills—the Watson-Glaser and the ACE 
tests. In fact, if loadings between 0.30 and 0.50 
are included, then it can be saidthat all 14 subtests 
of these two tests have loadings on Factor A. The 
Subtests in the Principles test are not substantially 
represented here, that is, noloadings over 0.50 
occurred. The largest loading of a subtest of the 
Principles test is 0. 43. 

Three subtests of the Principles test have load- 
ings from 0.53 to 0.74 on Factor B. Three other 
Subtests of this test have loadings between 0.30 and 
с; 50 оп this factor, as do four subtests of the ACE 
est. 

Factor C was made up almost entirely of subtest 
20 in the Principles test which involves identifica- 


tion of a criterion and assumptions. It is the only 
Subtest with a loading over 0.30 on Factor C, and 
its loading is high, 0.82. 


Interpretation 


Since nearly all subtests of the Watson-Glaser 
and ACE tests have high loadings on Factor A, and 
since so many different skills and abilities are sup- 
posedly being measured by the items of these sub- 
tests—deduction, drawing valid inferences, recog- 
nizing assumptions and relevant generalizations, 
and formulating hypotheses, etc. —this factor may 
be considered a General Reasoning Factor. 

Factor B is less easily identified on a logical ba- 
sis. Subtests 15, 16, and 17, inthe Principles test, 
which have the highest loadings on this factor (over 
0. 50) are labeled Recognition of Definitions, Seman- 
tics 1, and Material Fallacies. Actually, Factor B 
involves such skills as recognizing whether or nota 
definition is complete, how thetruth of an assertion 
can be decided, and what kind of a fallacy is illus- 
trated. Subtest 19 with a loading of 0. 47 requires 
the subject to decide whether or not the reasoning 
is sound. In order to show the reader more clear- 
ly the kind of item content and format involved here, 
a sample item from each ofthe two subtests having 
the highest loadings is quoted: 

““Тһе apple is red, large, and juicy. Therefore 

it is a good apple. "— which is to be markedas 

follows: 

** A-- the truth of the assertion can be decided 
with facts, either those given or additional 
facts. 

B-- the truth of the assertion is difficult to de- 
cide because criteria (standards) for de- 
ciding its truth are notgenerally agreed 
upon. 

C-- the truth of the assertion can be decided 
without facts. ’’ (from subtest 16) 

*t Acids turn blue litmus paper red. All three of 

these liquids turned the litmus paper red, and 

hence they are асійв.”” —which is to be marked 
as follows; 

**A-- itis a hasty generalization. 

B-- it begs the question. 

C-- it commits the fallacy of composition. 

D-- it commits the fallacy of argum entum ad 
hominem. 

E-- none of the above. ” (from subtest 17) 

Since these items, and many of those in other 
subtests having loadings between 0. 30 and 0. 50, all 
require the subject to make logical judgments or 
discriminations—to react critically to various 
statements—it is thought that Factor B can be call- 
ed Logical Discrimination or Application of Logical 
Principles. 

Subtest 20, in the Principles test, the only sub- 
test with a high loading on Factor C, consists of 
only three items, in one of which the subject is re- 
quired to know what a criterionis, and in the two 
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TABLE I 


VARIMAX FAC TOR LOADINGS FOR SUBTESTS OF THREE TESTS OF CRITIC AL THINKING 
SS MM——M—M———————————————— 


Test and Subtest Items No. of Items = тыры 

Watson-Glaser E 99 

1. Inference 1-20 20 . 69 1 -.07 

2. Recognition of assumptions 21-36 16 .67 -.03 11 

3. Deduction 37-61 25 .76 .27 .09 

4. Interpretation 62-85 24 .57 JT 215 

5. Evaluation of arguments 86-99 14 .45 .18 -.26 
ACE 52 

6. Direction of evidence 1- 6 6 ‚51 .25 -.03 

7. Valid inferences 1 7-10 4 - 54 .21 -.04 

8. Valid inferences 2 11-16 6 .63 .24 .02 

9. Relevant generalizations 17-21 5 .55 .22 -.04 

10. Recognition of assumptions қ 22-31 10 . 61 .32 -.08 

11. Valid inferences 3 32-34 3 .58 .18 -.04 

12. Valid inferences 4 35-37 3 .50 .43 .03 

13. Hypothesis formation 1 38-46 9 .54 .32 -.20 

14. Hypothesis formation 2 47-52 6 .32 .44 -.18 
Principles 53 

15. Recognition of definitions 1- 6 6 .25 $53 .06 

16. Semantics 1 7-13 T .16 .61 -.30 

17. Material fallacies 14-20 7 .01 .74 .12 

18. Semantics 2 21-27 7 .32 ES .29 

19. Rules of reasoning 28-40 13 .33 247 .16 

20. Identification of criterion and assumptions 41-43 3 .02 . 09 .82 

21. Valid inferences 44-53 10 .43 .39 :14 

———————MÀ———— 
Sum of squares 5.14 2.71 1.12 
Percentage of total variance 24.47 12.90 5.33 


І----------------- aee Se ee ee 2 
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TABLE П 


MEAN ITEM DIFFICULTIES OF SUBTESTS OF THREE TESTS OF CRITICAL THINKING 


А АААИИАИАИААЫН——_— 


=_=—_—_—_——Є—Є——Є—Є—Є—Є—ү—Є————-——————— 


Test and Subtest Number of Items Mean Item Difficulty 


———M——— ÀÁ————— — ——————————————————————————— 


Watson-Glaser 


* 
1 20 .62 
2 16 . 68 
3 25 .13 
4 24 ‚65 
5 14 .69 
АСЕ 
6 6 85 
7 4 .46 
8 6 61 
9 5 60 
10 10 " 73 
11 3 69 
12 3 57 
13 9 76 
14 6 62 
Principles 
15 6 ‚61 
16 7 . 60 
17 1 .44 
18 1 .49 
19 13 .48 
20 4 3 .28 
21 10 .46 
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others to know what an assumption is. Subtest 16, 
which also tests the subject's knowledge of seman- 
tics, has a loading of -0. 30 on Factor C. Herea 
Subject must tell how the truth of an assertion can 
be decided. Item 10, which is a sample question of 
this type, was quoted previously. Factor C is, 
therefore, called Semantics. Itshould be noted that 
Factor C is a near-specific and may be interpreted 
only most tentatively. 


Further Considerations 


The investigators wanted to be confident that the 
factor pattern was truly the result of differences in 
the kinds of abilities, skills, and kno wledges re- 
quired to answer the items correctly and not merely 
the result of differences in difficulty or abstractness, 
or format of the questions. They also wondered 
whether the fact that some items involved opinions 
affected the result. 

In order to ascertain whether or not differences 
in difficulty of items might explain the factor pat- 
tern obtained, mean item difficulties for the sub- 
tests were computed and are shown in Table II. 

No relationship can be seen between factor load- 
ings and mean item difficulties, except in the case 
of subtest 20 which has a loading of 0.82 on Factor 
C and a mean item difficulty of 0.28, which is con- 
siderably lower than mean item difficulties of the 
other subtests. Itis, therefore, possiblethat Fac- 
tor C, instead of measuring the subject’s knowledge 
of semantics, merely reflects therelative difficulty 
of the items of subtest 20. 

The effect of format was also investigated. The 
subtests contain a variety of kinds of items, all of 
the multiple choice type. The subject has two, 
three, four, or five choices from which to select 
the correct response. Some items are preceded by 
а set of alternative responses such as true, prob- 
ably true, probably false, false, etc., and instruc- 
tions regarding when to use each response, plus a 
description of a situation. Others were preceded by 
brief general instructions anda series of alternative 
responses in sentence form. In some subtests, the 
description of the situation precedes the list of al- 
ternative answers. In still others the items form a 
Sequence, that is, an answer depends upon thean- 
Swer to a previous item. Althoughanalysis reveal- 
ed that some differences in format among subtests 
having high loadings on the three factors are dis- 
cernible, yet these are not clearcut. Thus, class- 
ification of subtests according to format does not 
Seem related to the factor pattern of the subtests. 

In addition to investigating the effects of item 
difficulty and format, an attempt was made to dis- 

cover whether the items of some subtests were so 
worded that the opinions and beliefs of the subjects 
would affect their responses. A search was made 
for items containing undefined value terms , emo- 
tionally toned words, and for those requiring re- 
Sponses such as probably true, probably false, weak 


argument, or strong argument. Some interesting 
trends in responses to some isolated items such as 
items 15 and 20, two of the most difficult items in 
the Watson-Glaser test, were noted. Since item 15 
refers to a Negro doctor anditem 20 to the midnight 
closing law for dance halls, it is quite possible that 
responses were influenced by subjects' opinions. 
However, no relationships between subtests contain- 
ing items involving opinions and beliefs of the sub- 
jects and the factor pattern were readily discern- 
ible. The factor pattern obtained by factor analysis 
did not segregate these subtests from others. 
There is no evidence, therefore, that the factor 
pattern has been influenced by any of the following: 
the difficulty of items (with one exception), the for- 
mat of the questions, or the involvement of a sub- 
ject's beliefs or opinions. 


Summary 


Although Rust's earlier attempt to factor analyze 
the three critical thinking tests using item scores 
was fruitless, when the items were grouped accord- 
ing to the testmakers' logical grouping, factor an- 
alysis yielded three factors of sufficient strengthto 
warrant the conclusion that grouping items affects 
the apparent factor content of a test. (Inafuture 
investigation, it might be profitable systematically 
to attempt to discover the effect on the factor pat- 
tern of grouping the items in more than one way.) 

Two of the three factors obtained arefairly 
Strong; one seems quite appropriately to bear the 
label of General Reasoning, and the other, Logical 
Discrimination or Application of Logical Principles. 
The third factor, however, is less clear. It may 
represent sophistication in the use of language and 
for this reason has been tentatively called Seman- 
tics or Verbal Understanding. On the other hand, 
it may only be measuring the relative difficulty of 
the items. 
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А SOURCE OF USEFUL IDEAS FOR PROMOTING ATTENTION TO RESEARCH IN TEACHER ED- 
UCATION appears in a recent bulletin (No. 11, December 20, 1961) of the American 
Association of Colleges for Teacher Education. Single copies of this bulletin are avail- 
able from Dr. Richard E. Lawrence, Associate Secretary for Research and Studies, 
American Association of Colleges for Teacher Education, 1201 Sixteenth Street, NW., 

# 


Washington 6, D. C. ^ 
i i he 

4 ber 1961 Newsletter of the Inter- University Committee ont | 
DR:“RALEH асс ors of Honors programs should let him know if 


rior Student, requests that Direct: 3 5 
pui cis their campuses would like to cooperate іп studies of such programs. 
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It Says in JER continued.... 
a 


If a young teacher gets simple, prompt feedback from 
her pupils on how well she's doing, will her teaching 
techniques improve faster than if there is no such feed- 
back? You might think so, but IT DID NOT WORK OUT N ы 
THAT WAY іп arecent Californiastudy. McNeil, there- ' 
fore, believes that 1) feedback messages must be de- 
tailed and sophisticated, and 2) the teacher must know how 
* to change her procedures if she is in fact to meet stu- 


dent needs. 
ж”... 


* 

AMERICAN ADVANCED DEGREE CANDIDATES show a notabledack of confidence 
"іп their own intellectual prowess, if a California study is diagnostic. Potter 
asked some 400 post-baccalaureate students to perform abitofintrospection. ~ 

‚ They gave themselves high ratings on moral attitude, health, and coopera- 


_ боп, but they seldom found among their traits a creative imagination, a fa- 
cile command of facts, or the power of concentrated effort. This lack of ы 
` security suggests the need for counseling, Potter says. * 


) жжж 


“St. Wapniacl," **P.C.S. Mazintl," “Thirty days * p. 
hath September, ’’ “Оп old Olympus’ towering top! —DO ; 
THESE AND SIMILAR MNEMONIC DEVICES AID MEM- 

ORY? In a study of college students taking"a course in 

educational psychology, Adams found that a prescribed 

mnemonic device did not seem to help either rote or mean- 

ingful learning. On the.other hand, the device didn't do 

any harm. He reviews the literature of mnemonics and 

suggests the technique may about to be revived by those 

seeking unconventional approaches to problem solving. 


* si жж ж * 
* 

HOW ITEMS ARE GROUPED may affect the apparent factor content of a test, Rust, 
Jones, and Kaiser postulate. Although Rust's earlier attempt. to factor- 
analyze three critical-thinking tests using item scores was fruitless, when 
the items were grouped according to the testmakers' logical groupings 
factor-analysis yielded three factors of significant strength. i 
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A PERENNIAL DEBATE has been waged in А- 
merican education for at least half a century over 
the one-room versus the larger city and consolidat - 
ed school. Sophisticated educators, it is true, have 
had comparatively little ar gum ent with each other 
over this issue and have been aligned generally with 
the consolidated school. The public, however, 
cherishes the stereotype of the *«little red school 
house, '' and a persistent and vocal minority harbor 
a continuous faith in its ultimate superiority over 
all others. 

This study is nota comprehensive examination 

_ of the issue, but an attempt to provide new informa- 

* tion on the subject. Those using this report should 

perhaps be alerted to note that the c o mparisons 

made and reported here are between schools as they 

are, not as they might be. On the whole, the little 

schools, as observed by those reporting here, were 

` , «comparatively barren of facilities and staffed by 

teachers with less training than those іп the larger 
Schools in the same school district. 

This does not mean that they were inexpensive 
schools. In fact, those of very s тай enrollments 
had per capita pupil costs well above average for the 
district. Since such aspects are not under consid- 
eration here, this statement is sim ply to suggest 

7 that no one should generalize from this study that 
size of school has a simple; direct relationship to 
teaching efficiency. It is to suggest, however, that 
size of school may, through its relationship to a 
chain of other relationships, affect learning. "t 
Seems reasonable to suggest that large® schools 

* tend to have more facilities than small schools, 
tend to attract better-prepared teachers, and tend 
to be located in areas where there are more inci- 
dental educational opportunities for students--that 
а “сопврігасу”” of interrelated causes actually pro- 
duces the relationship of size of school to achieve- 
ment of students which this study makes evident. 
On the other hand, it is not denied that size itself 
may have a direct relationship all its own. 


Purpose of the Study : ж 


< Specifically, this study is an attempt to examine 

the possible relationship between performance of 

students on standardized achievement tests and the 
d 


. " Achievement of Students and Size of School 


Е - 


PAUL STREET, JAMES H. POWELL 
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size of the schools (in terms of enr o llment) which 
they attend іп a mining and а rural area of eastern 
Kentucky. Each area was treated separately and 
then bined. District À is a county which pro- 
duced about^one-eighth of the coal mined in the state 
in 1956. District В, а county in ahilly agricultural 
area, was reported in the 1950 U. S. Census as 
without an **urban" population, with 29.4% ‘‘rural- 
nonfarm” and 70. 6% ‘‘rural-farm’’ population. It 
seems reasonable to assume that each district re- 
spectively is typical of miningand rural eastern 
Kentucky districts generally, that the se venth and 
eighth grades are satisfactory levels to get adequate 
measures of achievement, and that the data here, 
therefore, constitute anacceptab le substitute for 
random sampling of such areas ineastern Kentucky. 


Procedure of the Study 


The Stanford Achievement tests were used in 
measuring levels of achievement in seventh and 
eighth grades in each of the sc hooldistricts. In 
District A (mining area), measures were taken in 
reading, spelling, language, and arithmetic.” The 
same measures were taken in District B (the rural 
area) plus measures in science and social science. 
In all, some 1, 150 students іп seve nth grade and 
1/000 in eighth grade took the tests in District A, 


“and 300 in seventh grade and 250 in eighth grade іп 


District B. ji с 
The elementary Schools in each district were 


_ classified by number enrolled into three gore 


GroupI - 300 or more. 

Group П - More than 100, less than 300. 

Group Ш- Less than 100. 

To give the reader some idea of the size of the 
schools, we report here the total enrollments ang 
numbers of the schools in each group. In Dis tric 
A there were 4,659 students enrolled in eleyen 
schools of enrollment greater than 300, 3, 524 in 
20 schools of enrollment less than 300 and greater 
than 100, and 1,530 in 30schools of enrollment less 
than 100. 

In District B there were 836 students enrolled 
in one school of enrollment greater than 300,520 in 
three schools of enrollment less than 300 and great- 
er than 100, and 1,301 in 52 schools of enrollment 
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less than 100. !* 

It may be pointed out that District B furnished a 
greater number of one-room schools and a larger 
number of seventh and eighth-grade students in 
Group III schools than did District А. Of the 30 
District А schools in Group III, twelve were опе- 
teacher schools, whereas 47 of the 52 in Group III 
in District B were one-room units. The frequen- 
cies of those who took the testsare indicated in Ta- 
bles I - Ш. » ж” 

Mean scores, variances, у; (skewness), and у: 
(kurtosis) were calculatedfor each ofthe two grade 
levels, on each of the tests, for each school district 
Separately, and for each of thethree groups separ- 
ately. 

Upon examination of these moments it was quite 
obvious that the assumption that the statistics Хі- 
Xj (i=1, II; j - II, Ш; i<j) (the differences between 
group means) follow a normal distribution was ap- 
propriate. This assumption permittedthe construc- 
tion of 95% and 99% confidence interval estimates 
of the real differences between population means, 

Hai 7 Hj. 

The бо confidence limits are reported in Tables 
I, П, and M. For example, in Table I, eighth 
grade reading, Хү - Хр = .83and 2.57 Ky - Xm = 
‚32. This is written as . 83+ .32. The 99% confi- 
dence interval estimate of ШІ - uris given by 

.83-.32€ up- ug <.83+.32ог 

91€ up-ug <1.15 
Since the interval does not include 0, the null hy- 
pothesis (uj - итү = 0) is rejected at the . 01 level 

of significance as is indicated by the .01 immedi- 
ately to the right of . 83 +.32. АП computations 
were made with the IBM 650 in the Computing Cen- 
ter of the University of Kentueky. 

The results for District А may be obser ved in 
Table І. This table shows a consistent pattern of 
evidence that those in Group I out-performed those 
in Groups II and III, and that those in Group II out- 
performed those in Group Ш. That is to say: In 
District A, in 24 comparisons of the arithmetic 
means of the scores of students in larger schools to 
those of students in smallerschools ; the larger 
Schools had the advantage in every instance. 

In District B (Table II), in 36 comparisons of the 
arithmetic means of the scores of students in larger 
Schools to those of students in smaller Schools, the 
larger schools had the advantage in 32 instances. 
Indeed, Group I out-performed both the other groups 
in all 24 comparisons to them. Furthe r, it does 
not Seem unreasonable to Suspect that, since these 
exceptions were in comparisons between schools 
classified in immediately proximate intervals, and 
also because the differences in favor of the smaller 
Schools in each instance were small and not statis- 
tically significant at the . 05 level, these variations 
in the pattern may be a consequence of chance fac- 
tors. ы 


PENE TEE ше сыз = МА. 7. 1:7 ioe 
* Footnotes will be found at the end of the article. 


Regarding the patterns of the two districts from 
the viewpoint of specific probabilities: 

In District А, the students inthe largest schools 
(Group I) demonstrated a real superiority over 
those in the second-largest schools (Group II) on 
all four of the tests in both grades ata significance 
level of .01, thus rejecting the null hypothesis in 
eight out of eight examples. Incomparison between 
the largest (Group I) and the smallest (Group III) 
Schools, the hypothesis was similarly rejected in 
Six of the eight examples, and rejected at the . 05 
level of significance in the other two. Six of the 
examples of the eight obtained by comparing scores 
in Group II to those of Group ІШ, though they tend to 
reject the hypothesis, since they do indicate a like- 
ly difference in favor of the larger Schools, do not 
do so at an acceptable (. 05) level of significance. 
The other two of the eight do indicate differences 
Statistically defensible as real at the . 01 level of 
Significance. 

The same general pattern, though not quite so 
clearly pronounced, is represented in the evidence 
from District B, with the exception of the four re- 
versals to be noted in comparisons between Groups 
П апа Ш. Again, all differences between scores 
for Group I and the other twogroups аге in favor of 
the largest schools. Out of 12 co mparisons be- 
tween Groups I and II, the differences infavor of 
Group I are in eight examples defensible at the . 01 
level of significance. Also, considering differen- 
ces between Groups П andIII, eleven of the twelve 
are defensible at the .01 level, andthe other at 
-05 level. It is true that four of the examples of 
comparisons between Groups П and III actually re- 
versed the pattern, though notatanacceptable (.05) 
levelof significance. That is, though the smallest 
Schools did actually out-perform the larger (mid- 
dlesize) schools in the examples, they did not do so 
on a statistically acceptable basis that the differen- 
ces in their favor at any instance mightbe accepted 
as “теа1,” even at an 05 level of Significance. 


The Two Districts Combined 
The Two Districts Combined. 


There were important differences between Dis- 
trict A and District В. Specifically, there was a 
generally consistent difference in the achievement 
levels of students as measured by the tests. Тһе 
investigators hesitated, therefore, to combine the 
data and treat the two as one district, although the 
original expectation had been to do sosince the 
effect would be a gratifying increase in frequencies 
for each category. 

The combination of data from both districts, in 
the four subjects in which tests Were given in both 
districts, effected Table Ш. The diff erences in 
level of performance generally between students in 
the two districts produced a “stale mate” in the 
issue over differences between Group II and Group 


TABLE I 
COMPARISON OF DIFFERENCE IN MEAN SCORES OF STUDENTS IN 
SCHOOLS OF DIFFERENT SIZES IN DISTRICT A 


Number of Scores Differences of Mean Scores Among Three Groups* 
Group I Group П Group III Group I-- Signif. Group I-- Signif. Group II-- Signif. 


2300 (<300, >100) (<100) Group П Level** Group II Level** Group III Level** 
ыр Level** Group П Level** Group Ш Level+* 


Seventh Grade 


"Reading 592 409 108 260% cab SUL 86 + .44 101%, 226 +1738 >.05 
Spelling 601 407 110 23105527 01 .39 t .45 .05 08 t .44 >. 05 
Language 601 408 110 .57 + .42 .01 1.24 t .70 .01 .67 t.63 .01 
Arithmetic 591 410 107 .49 1.22  .01 210 $5537 .01 .21%.34 2.05 

* 
Eighth Grade 

^ Reading*** 526 378 72 :B3* .32. ..01 1:111 62 201 -72875;57 2.05 
Spelling 533 378 71 SC tute AU OL apart ба 405 .04%.59 2.05 
Language 533 377 71 174% .48  .01 1.81% ,90 .01 1:07 + .86 01 
Arithmetic*eek 521 372 68 боов OL -96 * .56 :01 .30%.50 >.05 


*In school years of achievement. 
**That is, level at which differences between means are statistically significant. 
***Composite of "paragraph meaning" and "word meaning" scores. 
жок Composite of "computation" and "skills" scores. 
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TABLE II 


COMPARISON OF DIFFERENCE IN MEAN SCORES OF STUDENTS IN 
SCHOOLS OF DIFFERENT SIZES IN DISTRICT B 


Number of Scores Differences of Mean Scores Among Three Groups* 
GroupI Group II Group Ш Group I-- Signif. Group I-- Signif. Group П-- Signif. 
" (2300) (<300,>100) (<100) Group II Level** Group Ш Level** Group Ш Level** 


Seventh: Grade: 
Reading 105 69 130 1.137 .65 .01 .87*.58  .01 -.26+.57 


>.05 
Spelling 104 69 129 273 13058.01 Ae EEST 208. 22727 Бох o OR 
Language 106 69 130 1.49 +113 .01 1.54*.95  .01 .05 11.06 2.05 
Arithmetic 107 69 130 .67t.58 .01 «51 t .47 .01 -.16+.50 >.05 
Science 105 69 131 .98t.72 .01 .99 t .61 .01 .01t.67 2.05 
Social Studies 105 69 133 .90 1.58 .01 .57T t 53 -01 -.33 t.53 2.05 


Eighth Grade 


Reading 69 51 121 1.09 + .98 .01 1522. 101 .43t.64 >.05 
Spelling 69 51 121 .68 t .94 >05 1.46t.72 .01 .78 + .77 .01 
Language 69 51 121 .58 t1.46 >,05 2.11%114 .01 1.53 21.16 101 
Arithmetic 69 51 120 .51%.86 >05 1.12t.64 .01 vir ii e 563 .05 
Science 69 51 121 .57 7112 x05 1.56 + ,81 .01 .99 + „83 .01 
Social Science 69 51 121 .96+.95 .01 1.04%.67 .01 .08*.61 >.05 
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TABLE III 


COMPARISON OF DIFFERENCE IN MEAN SCORES OF STUDENTS IN 
SCHOOLS OF DIFFERENT SIZES IN DISTRICTS A AND B 


Number of Scores Differences of Mean Scores Among Three Groups* 
Groupl Group II Group III Group I-- Signif. Group I-- Signif. Group II-- Signif. 
(2300) (<300, >100) (<300) Group П Level** Group II Level** Group II Level** 


Seventh Grade 


Reading 697 418 238 .68*.24 .01 5497322833. 70] -.19t.29 >.05 
Spelling 705 476 239 438 * 725 01 ОТАВА 05/8524: 0.732 4 205 
Language 707 477 240 .71t.40 .01 18t.53 . .01 .07t.50 >.05 
Arithmetic 698 419 237 252 t hel C Ol 232717720) 01 -.20%.26 >.05 
Eighth Grade 
Reading 595 429 193 586 1..31: $01 .98 + .41 .01 .12%%37 2.05 
Spelling 602 429 192 .51t.31 .01 .46 t .41 .01 -.05t.40 2.05 
Language 602 427 192 .20* 46 .01 1232 072617 5.01 4623.59: Б 01 
Arithmetic 590 423 188 .64t .27 .01 .60 * .36 .01 -.04%,34 >,05 
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II. That is to say, the students in Group Ш in one 
district generally equaled or out-performed those 
іп Group П in the other, thoughthey did notdo so in 
their own district. 

This suggests an important obser va tion: The 
generalization which, at leastto some exte nt, 
emerges from this study—namely, that students in 
larger Schools tend to out-perform those in small- 
er—should be limited in application toa single 
school district (or at least homogeneous districts), 
at one time. It seems reasonable to assume that a 
small school in one district might out- perform a 
large school in another. The combination of Dis- 
tricts A and B here, however, did strengthen the 
evidence that the differences instudent performance 
between schools of greater than 300 enrolled and 
those of less is real—at leastfor the examp les 
given. That is, when the differences in size of 
School were great enough, they overcame other var- 
iables affecting student performance. Indeed, іп 15 
out of 16 examples of comparisons of Group I tothe 
other two groups, the differences in means are es- 
tablished as real at the . 01 level of significance. 
The differences in favor of Group II over Group III 
are established at a significance level of .01 only 
in one instance (language), but none of the differen- 
ces in favor of III over II are demonstrated even at 
a significance level of . 05. 


Conclusions 


The evidence of this study points towarda strong 
likelihood that students inlargerschools (above 300 


enrollment?) tend to out-perform students in smal- 
ler schools, in the same (or perhaps com parable) 
districts. The evidence does not reject the possi- 
bility that factors other than size influence the dif- 
ferences in levels of achievement of students. It 
does imply that as schools of such different sizes 
actually are, with whatever differences in addition 
to size actually exist in them, the larger schools 
appear to produce higher achievement levels among 
students. It would be incautious for the investiga- 
tors here to generalize beyond the limits of the 
study in its restriction to rural and mining areas of 
Kentucky, schools of less than 900 enrollment, and 
achievement in the subject areas in which the tests 
were given. On the other hand, they have consider- 
able confidence that, in the absence of evidence to 
the contrary, the findings here have application to 
the issues which grow out of any debate over the 
value of the small versuslarge schoolas such 
schools are actually equipped, staffed, and operat- 
ed in general. 


FOOTNOTES 


1. Forty-seven of these were one-room schools, 
enrollments running as low as four students in 
one school. 

2. Since the largest school included in the study 
enrolled only 836 students, it should not be 
concluded here that there is no limit on how 
large a school should be. Indeed, even the 
**Jarge''ones here may be regardedas “тай” 
by many standards. 
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How Good |5 a Prescribed Mnemonic Device 


in Learning Textbook Content? 


“Thirty days hath September 
April, June and October, 
АП the rest have peanut brittle, 
Except Grandma 
And she rides around on a little red tricycle. ’’ 


THIS OLD saying burlesques the popular mne- 
monic device for learning the lengths of the months. 
Mnemonic devices are known to most people and 
used by many. Are they helpful to learning, or are 
they harmful? 

The dictionary describes mnemonic devices 
somewhat skeptically: '*Mnemonics: Theartthat as- 
sists, or attempts to assist, memory” (Merriam 
Webster). Mitchell’s Enc yclopedia Britannica ar- 
ticle (9) gives a good introductionto mnemonics and 
mnemonic devices. A mnemonic device is usually a 
verbal statement or sequence. Each word in the 
statement stands for alarger block of the content to 
be learned. There are also geometric mnemonic 
devices, such as a pattern of the eight corners of a 
lecture hall, each to remind the lecturer of a part 
of his speech. 

This experiment aimed to help answer the ques- 
tion of whether mnemonics helped in learning. The 
subjects were given a learning and memory task to 
accomplish. Part of them were told to use a mne- 
monic scheme. The others were asked to learn 
without the scheme. 

The author presented the experiment to the stu- 
dent subjects as one which would illustrate an appli- 
cation of scientific method. The group using the 
mnemonic scheme formed an experimental popula- 
tion. The other students, who learned without the 
mnemonic device, were a control population. 

The students were enrolled in a basic education- 
al psychology course. This was, for them, a first 
course in education. About а fifth of the class had 
had some teaching experience. Womenslightly out- 
numbered the men in the course. 

The members of the class were assigned to the 
experimental or to the control group according to 
whether they had odd or even seat numbers. The 
reasons for this assignment were: 

1) It was a quick and readily under standable 

method. 


SIDNEY ADAMS 
Arlington, Virginia 


2) There was no important preference or bias in 

the students having odd or even chairs. 

3) There were approximately equal numbers in 

the two groups. 

Both groups learned the same material. The ex- 
perimental group learned with the help of the same 
assigned mnemonic aid. The control group learned 
the same material without a mnemonic device. The 
device was not given to the control students. 


Instructions to the Subjects 


Both groups were told to learn Chapter 16 of 
Guthrie and Powers ‘‘Educational Psychology’’ (5). 
This chapter covered transfer of training. At the 
time of the experiment, the chapter was not sched- 
uled for study until several days in the future. The 
class was told to study the chapter in detail. They 
were told to learn, as well as they could, names, 
facts, the meaning and implication of and generali- 
zations on the transfer of training. 

Both groups were urged to learn key words of the 
bold-faced type paragraph headings of the chapter. 
These headings were to serve as a framework for 
remembering the chapter contents. The key words 
were: 


Problems Related Factors 
Viewpoints Subject Matter 
History Attitudes 
Definitions Teacher 


The mnemonics (experimental) students were 
given in their dittoed handouts a sentence “Prickly 
vines have decidedly rugged spines and thorns." 
The initials of the words in the mnemonic sentence 
were in the same order as the initials of the para- 
graph topics. The experimental group 8 tudents 
were told to learn the sentence, to use the sentence 
as a help in learning the key words of the paragraph 
titles, and the paragraph titles as an aid in learning 
the chapter contents. 


Past Use of Mnemonics by the Subjects 


The students were asked whether they used mne- 
monic devices often, rarely, or never. Their re- 
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plies are shown in Table I. Further comparison of 
the experimental and control s ubjects in Table II 
Shows the experimental group to be largely male, 
and the control group largely female. This table 
also shows that, in final performance in the educa- 
tional psychology course, the two populations were 
equal. The course grade was determined by two 
objective tests of the same format (multiple choice) 
as the experimental test. None of the questions in 
the experimental test appeared in either of the 
course examinations. к 


What the Subjects Studied 


The contents of Chapter 16,which the students 
were asked to learn, are indicated by the headings 
of the paragraphs of the chapter: 

Some Problems of Transfer 
Present Point of View on Transfer 
Historical Background 

Definition of Transfer 

Factors Related to Transfer 
Transfer of Subject Matter 
Transfer of Attitudes 

Transfer of Training and Teaching 

The first part of the chapter states problems in 
transfer of training, and cites experiments and re- 
sults from a considerable number of authors. Тһе 
Second part of the chapter draws со nclusions, and 
makes recommendations as to the best way of get- 
ting desired transfer of training. 


Experimental Testing Procedure 


The instruction sheets for the experimental and 
for the control subjects were passed out in immedi - 
ate succession at the end of a class period. The 
Class met in the early afternoon. The sheets were 
taken up at the beginning of the class onthe next af- 
ternoon. In the latter part of this second class pe- 
riod, the students were tested. They were told that 
the test would not count on their course grade, but 
that the content would probably be covered in later 
examinations. 

The experimental test was made up of 21 four- 
choice multiple-choice questions. The students in- 
dicated their choice by writing the letter of choice 
after the question. There was no separate answer 
sheet. All the participants were able to cover the 
entire test. 


Results 
How effective was the mnemonic sentence in 


helping the students in the experimental population 
to learn the chapter content? Table Ш shows the 


outcome. : 

The difference between the two populationsis. 
microscopic and of course negligible. The use of 
the mnemonic device for learning the outline did not 
help in learning the content of the chapter. * 

In studying a text, the student should learn sig- 
nificant facts, connect the facts with each other, and | 
draw conclusions, Mnemonics did not help to do . 
this, in this experiment. perhaps this was expect- 
ing too much of a simple device. For such help, 
the mnemonic device would not only have to aid the 
Student in learning the bare outline, but would have 
to further assume that the outline would be very im- 
portant in the learning of the text itself. Mnemon- 
ic devices have been most used in learning lists, 
outlines, bare-boned rather than rich content. 

Did the mnemonic device help the students learn 
the bare outline? Nearly all the students lear ned 
the outline perfectly. The compar ison of the two 
populations is shown in Table IV. 

There is certainly little to choose between the 
two groups. The experimental population had in- 
significantly more perfect recalls, and an insignif- 
icantly smaller average recall than did the control 
population. The one-sentence mnemonic device did 
not help the students in either meaningful or rote 
learning. 


Kinds of Mnemonics Used by the Students 


It has already been pointed out (in Table I) that 
most of the students reported using mnemonic de- 
vices. АП were asked:to give examples of mne- 
monic devices they knew or used. Some of the de- 
vices were specific. Others were generalized aids 
to learning. A listing of general devices from the 
students' returns included: 

Alphabetizing when learning a long list. 

Robert Hutt system (events or some other se- 
ries as ‘‘mental hooks"! on which to hang 
long lists to be memorized). 

Building sentences from key words. 

Making rhythms and rhymes. 

Notice similar beginnings and endings іп 
words to be learned. 

Use the first letters, in order, of a list to be 
learned, as the first letters of the words of 
the memory aid sentence. (This is the ex- 
perimental method of this study. ) 

Learn the first letters of a list. 

Arrange the list in chronological order, 

Thus, most methods seem to d ep end on forcing 
the material to be learned into a meaningful or half- 
meaningful order. The method used inthis experi- 
ment seems consistent with the methods which the 
Students already practiced. 


* The students with imperfect recall of the chapter topics showed the following performance on their test for 
Chapter 16, as compared with the students who had perfect recall of the topics: 12. 7 right answers against 
13.6. Тһе difference is small but in the expected direction. 


Study Time 


The students were asked for their estimates of 
the study time which they gave over to learning the 
chapter. The reported times ranged from none at 
all (for one experimental and one control subject) to 
more than three hours. The averages andvariabil- 


ities appear below: Mean S.D. 


Experimental Population 71.2 min. 35.7 min. 


Control Population 63.4 min. 28.6 min. 

There was a wide variation inthe reported times. 
The experimental group, using the added step of 
learning the mnemonic sentence, claim to have put 
in actually less overall study time than did the con- 
trol group. However, the difference is not signifi- 
cant statistically. The sigma of the difference be- 
tween the means is more than 9— in excess of the 
difference itself. For this study, it may be con- 
cluded that the sentence mnemonic device used did 
not help or interfere with learning. 


Rationale of Mnemonic Devices 


Learning by means of a mnemonic d ev ice is de- 
scribed by Mitchell (9) as aiming to ‘‘enable the 
mind to reproduce an unfamiliar idea, and especial- 
ly a series of dissociated ideas, by connecting them 
in some artificial whole, the parts of which are mu- 
tually suggestive". Mnemonic devices may be ap- 
plied to meaningful learning. Here presumably the 
mnemonic device may be discarded and forgotten as 
Soon as the learned content has been meaningfully 
organized. But, as Mitchell’s statement suggests, 
mnemonic devices particularly suggest themselves 
for forced rote learning. Often the device and the 
material learned would be discarded together when 
the need for both had passed. 

An explanation for mnemonic devices can readily 
be found in the psychology of Herbart. A mnemonic 
jingle or other device provides a structuredcore for 
the apperception mass of the learned material. 
Certainly Herbart lived in agreatage of mnemonics. 


Origins of Mnemonic Devices 


Indeed, however, mnemonics flourished through- 
out most of the literate past. We can calla roll of 
practitioners of mnemonics: Aristotle, Plato, Ro- 
ger Bacon Raymond Lully, Petrus di Ravenna, 
Lambert Schenkel, Della Porta, Stanislaus Munk 
von Winckelmann, HermanKethe, andEdmund Pick, 
down to Pelman, whose system has been followed 
in our own lifetimes. While learning by mnemonics 
is contrasted with logical learning, a long-estab- 
lished use of mnemonics has been in teaching the 
“figures” of logic. 
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Mnemonics in Modern Psychology 


There is little about mnemonics in modern psy- 
chological literature. Most of that little is hostile. 
Harriman’s Dictionary (6) has an article ‘‘mnemon- 
ic devices” which is unfavorable. McGeoch and 
Irion (8) are against artificial mnemonic schemes, 
but think that associations adapted to the learner 
and natural for the learned content may be helpful. 
(p. 478) Balaban (1) favored meaningful sequences 
which the learner could seek out— these would be 
better than rote learning. From a past generation, 
Binet (2) seemed favorable. He favored tying in 
mnemonic words with the initial letters of words to 
be learned. Reed (10) in studying associative aids 
for connecting Latin and English word pairs, found 
that the aids were dropped out as practice continu- 
ed. This is a crucial point in determining the use- 
fulness of mnemonics. Gamble (3) found leisurely 
study desirable for the best use of mnemonic de- 
vices. Guilford (4) seems unfavorable to artificial 
mnemonics, He regards the form which guides 
learning as emerging by degrees during the learing 
process. Heidbreder (7) gives examples of mne- 
monic devices which her subjects used in concept 
formation experiments. 


Mnemonics in Use 


Examination of a small number of textbooks 
showed a near-absence of mnemonics. A curr ent 
chemistry text used P C S Mazintltogive the des- 
cending order of electrochemicalactivity of the 
metals from potassium to lead. The imaginary Ma- 
zintl is a character in mnemonic lore like St. Wap- 
niacl, in which the mnemonic system ceases alto- 
gether to have system. St. Wapniaclfor years pre- 
sumed to aid in the recall of the U.S. cabinet of- 
fices. He did not survive the defense reorganiza- 
tion or the rise of the Department of Health, Educa- 
tion and Welfare— nor has he been replaced by any 
new shadow character. Among those seeking new 
and unconventional approaches to problem solving, 
as the operations analysts, the old method of mne- 
monics may be revived. 


Summary 


In a study of college students learning meaning- 
ful materials, with or withouta prescribed sentence 
associated with textbook topics, the mnemonic de- 
vice did not appear to help either rote or meaningful 
learning. On the other hand, the device did not ap- 
pear to increase the time spent on learning. 
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FREQUENCY OF NORMAL USE OF MNEMONICS BY THE SUBJECTS 


——— O 


Mnemonics Used ^ Experimental Group 


Control Group 


N % N % N % 
солид ЕСЕГЕ a MEICECU idet io ien 
Often 4 18.2 31.0 13 25.5 
Rarely 9 40.9 44.8 22 43.1 
Never 9 40.9 24.1 16 31.4 


TABLE II 


SEX AND COURSE GRADE OF SUBJECTS 


se 


Experimental Group 


Control Group 


— eee ет! 


Меп 13 
Women 9 
Total 22 
Course Grade Mean 176.6 


SD 9.3 


10 23 
19 28 
29 51 
76.9 

6.7 
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TABLE III 


MASTERY OF LEARNING TASK BY THE SUBJECTS—CORRECT 
ANSWERS ON CHAPTER CONTENT 


luus EE CÀOILNLJOJszzubeuys AAAGLLLLó kt 


Experimental Group Control Group 
Еа ЕОС сол ыз i 
N 22 29 
Mean 13.59 13.55 
SD 3.23 3.05 

TABLE IV 


MEMORY OF PARAGRAPH HEADINGS BY THE SUBJECTS 


Experimental Group Control Group 
N 22 29 
Number Recalling all Headings 18 23 
Percent Recalling all Headings 18.8 79.3 
Mean Number of Headings Recalled 7.21 7.45 
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S Field News 


HANDWRITING RESEARCH. .... 

An Invitational Conference on Research in 
Handwriting was sponsored by the Committee on Research in Basic Skills at the Uni- 
versity of Wisconsin. Dr. Virgil Herrick reported that in a random national survey he 
found that only one school in five attempts to individualize instruction in handwriting, 
and only about seven percent of the schools had diagnostic and remedial programs in 
handwriting. Elaine M. Templin conducted a survey of 1946 public high-school gradu- 
ates in twenty communities on the eastern seaboard. She found that the pencil, ball- 
point pen, and fountain pen—in that order—are the most commonly used tools of writ- 
ing. The most common uses of handwriting were: making out checks, social corres- 
pondence, filling informs, preparing Shopping lists, and signing or initialling forms 
or letters. Women averaged more handwriting than men. 


NATIONAL MERIT SCHOLARSHIP CORPORATION TO STUDY TALENTED STUDENTS..... 
Robert C. Nichols, former Assistant Profes- 
Sor of Psychology at Purdue University, has been appointed by the Research Division 
of the National Merit Scholarship Corporation to study the personality development of 
academically talented students, the mental health of the talented, and the effects of dif- 
ferent types of colleges on different kinds of students. 


HANDBOOK ON RESEARCH IN TEACHING. .... 
It is expected that the new book, Handbook on 


Research in Teaching, edited by Dr. N. L. Gage, will be published in the summer 
of 1962. 


REVIEW OF EDUCATIONAL RESEARCH..... 
Dr. William B. Michael, Director of the Test- 


ing Bureau and Professor of Psychology and Education at the University of Southern 
California, will succeed Dr. David Krathwohl as editor of the Review of Educational 
Research. 


FEDERALLY FINANCED RESEARCH PROGRAMS..... 
One of the conclusions reached in a Brookings 


Institution study, The Role of the Federal Government in Financing Higher Education 
by Alice M. Rivlin, is that research programs financed by the Federal Government 
should be continued. The Federal Government, however, should assume a larger share 
of the costs of higher education with greater concern for the teaching function than for 


earch. 
ge Continued on page 278,..... 
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Decision-Making in Speech Activities 


ALL HIGH SCHOOLS have speech activities, ei- 
ther curricularized or as part of the extracurricu- 
lum. Some teachers may tend to seek excellence 
inthis area as a valued quality in itself, and ignore 
the fact that speech activities are merely vehicles 
to produce the educated individual. 

It is inconsistent to assume that students educat- 
ed in an autocratic school will later become demo- 
cratic in their adult living. People become demo- 
cratic in action and in spirit only through practicing 
democracy. An essential characteristic ofa demo- 
cratic society is that all involved in a decision shall 
take responsibility for making that decision. 

Ап attempt is made to determine the extent to 
which high school students themselves believe they 
influence the management of Speech activities. 
This provides some clues as to the school's effec- 
tiveness in educating for democratic living. 

An instrument 2 of 41 items was submitted to all 
the seniors in a sample of 30 Minnesota schools. 
The number of control practices observed by each 
Student is his score. The school score is the mean 
Score of all the seniors, both the participants and 
the non-participants in speech activities. 

The enrollment in these senior high schools 
varies from 49 to 900, with graduating classes of 
nine to 234 students. The girls outnumber the boys 
ineach school. The meanschool has an enrollment 
of 174 students with $2,530 of taxable wealth behind 
each resident student in the district. Тһе districts 

„Spend from $242 to $425 toeducate each high 
School student, or a yearly average of $345. 


Participation in Speech Activities 


In the typical school, 45% of the boys and 64% of 
the girls are in speech work. In one ofthe 30 
Schools, every senior reports such participation, 
While in two others, such extensive provisions are 
made for all the girls only. 

School size is a potent indicator of the proportion 
of students in speech activities. Іп the small 
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Schools almost every senior reports such participa- 
tion. 

There is a significant correlation between the 
proportion of students inspeech work and the wealth 
of the district.* Wealth is not an independent var- 
iable because holding either school size or the ex- 
penditure level constant causes the wealth-partici- 
pation correlation to be no longer significant. 

High expenditure schools provide more oppor- 
tunities for speech work. This is partially explain- 
ed by the higher pupil costs in the small schools. 
When size is held constant, there is norelationship 
between extent of participation in speech activities 
and the school expenditure level. 


Speech Activity Scores 


The highest possible score on the instrument is 
41. The best score obtained by any school is 19.5, 
three times as high as the lowest school score of 
6.4. The difference is significant, indicating a real 
difference in the ways schools operate their speech 
activities. 

Boys are more critical of speech activities than 
are girls. In one school the boys influence speech 
activities in 3. 9 ways, whilein another they report 
19.1 practices. Girls claim significantly more 
control opportunities, ranging from 7.9 to 21.9 
practices in the extreme schools. 

Speech activities are greatly influenced by the 
participants in them. In the lowestschools the par- 
ticipants average 11. 6 practices, while in another 
they claim 23.6, or a mean of 16.77 for all schools. 
In three schools none of the non-participants in 
speech observed any of the practices on the instru- 
ment, while the non-participants in the highest 
school reported 14.4. This difference between the 
participants and the non-participants inspeech work 
is significant at the . 001 level. 

Small schools and schools in wealthy districts 
allow considerably more freedom to all students to 
operate speech activities. Tablellshows, however, 


1. А copy of the questionnaire may be obtained from the writer. 
Significance is taken at the . 05 level throughout the study. 
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TABLE I 


SPEECH ACTIVITIES CHARACTERISTICS 


——————À MM 


Schools in Order Percentage of Students Speech Activities Annual Speech Costs 
of Size Participating Mean Scores Per Pupil Participating Per Secondary Student 
A 35% 8.9* $21.00 $5.67 
B 46 12.5 4.41 1.85 
с 24 9.8 6.58 1.11 
D 31 6. 4* 5.25 1.10 
E 36 13.6 5.00 1.41 
Е 40 9. 0% 5.38 1.81 
G 36 14.2** 12.08 3.86 
H 44 7.2 5.19 2.05 
І 43 9. 0% 12.12 4.57 
J 60 11.6 2.47 1.15 
K 44 9.2* 2.94 .88 
L 43 9.7 19.46 7.84 
M 48 9. 6* 13. 43 6.77 
N 36 9.3 2.78 .81 
о 49 14. 1** 24.64 13. 93 
P 50 11.7 3.92 1.72 
Q 67 13.2 12.70 7.22 
R 43 9.9 18.59 6.25 
5 71 13.7 11.67 6.14 
т 56 19.5** 10.97 5.42 
U 62 14.2** 8.78 4. 93 
M 95 11.5 19.17 14.38 
WwW 57 13.5 13.33 6.00 
x 76 13.5 1.92 1.07 
Y 59 14-6%ж 68.33 39.21 
2 50 13.9 2.27 1.14 
АА 94 19. 0** 7.98 5.98 
BB 100 14. 7** 3.33 1.82 4 
сс 78 18.3** 23.80 10. 00 
DD 56 11.4 14.22 5.22 
Mean Values 54. 3% 12.22 $12.12 $5. 70 


ж Denotes low scoring school used in item analysis. ** Denotes high scoring school used in item analysis. 
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TABLE I 
CORRELATIONS FOR SPEECH ACTIVITIES 


SSS gS 


Percentage of Mean Speech Scores Speech Activity Costs 
Variables Participation Participants Boys Girls Total Per Per H. S. 


Participant Student 


—_—_—_—_—_— Se ee 


Percentage of Participation 


Boys .74 

Girls .42 

Total . 06 .60 . 09 .23 
Wealth per Resident Pupil .41 .14 .38 .18 .28 
Average Daily Attendance -.51 -.11 -.40 -.07 -.20 

Maintenance Cost per Pupil Unit .49 133 .36 .08 .14 

Speech Costs per Participant .16 .18 .97 
Speech Costs per High School Student .14 .26 


— == —————————»ÀÉÁÁ————— ы а Аы E ee 


An r of .36 is significant at the . 05 level. 


| TABLE Ш 
t-TESTS OF SPEECH ACTIVITY SCORES 


=———————————————Є————————Єк"—Є————Є]ү—үї———Є——————— 
Significance Level Number of Items at this Level Mean Number of Practices Observed by Entire Senior Class 


High Scoring Schools Low Scoring Schools 
. 001 6 3.04 1.39 
.01 13 6. 66 3.58 
.02 7 2.92 1.52 
.05 5 1.16 .62 
.10 5 1.26 78 
Not Significant 5 81 .51 


—— Il ВОЕННИ MM ала сс шы аар Е 
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TABLE IV 


SPEECH ACTIVITIES QUESTIONNAIRE 


—————Є—к"——Є—Є—ЄЄ&—Є&Є—ЄЄ—Є—Є—Є—Є—Є————Є—Є—Є—Є——— 


Check One 
Yes No 


Did you feel responsible for the financial success of this activity? 


Did you ever help to plan programs or schedules for this activity? 

Was activity program free of pressure from non-school people ? 

Have the students planned definite goals for this activity? 

Did you help choose just what this activity would do? 

Was available activity money spent as students wished? 

Did you help decide how the money was to be spent? 

Did you help decide how the money was tobe raised or earned? 

Did you help select the student who handled the speech activity money? 
Did you help decide how surpluses were disposed of at end of the year? 
Was the money spent as the majority wished? 

Were costs so low you could be in any activity you wished? 

Did you have a lot of fun in speech activities ? 

Were prices, admission fees, dues, etc. , recommended by students ? 

Did this group meet fairly regularly? 

Could the financial standing of this activity be quickly determined? 


Did the school generally supervise extracurricular finances ? 


Was student interest the primary requirement for participation? 


Are records kept of participation in activities ? 
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that wealth and school size do not particularly af- 
fect the amount of student control which the partici- 
pants in speech activities claim for themselves. 

Both the participants and the entire student body 
members feel they gain control over Speech activi- 
ties in the higher expenditure schools. When 
Schools exist at a bare minimum leve 1, decision- 
making is not left to students. The costs of Speech 
activities bear a low but positive relationship to the 
number of opportunities given students to influence 
them. 


Speech Activity Costs 


Speech activity funds are derived from many 
Sources, as dues, admissions, and direct school 
board appropriations. In addition, principals pro- 
rated teachers’ salaries on the basis of time in- 
volved in speech activities. 

In the senior year an average of $12. 12 is spent 
for each person in speech, One school reports 
costs of $68.33 per participant, while the lowest 
indicated is $1.92, a range of approximately 34 to 
one. Computed differently, the yearly costs are 
from $39.20 to $. 80 for each high school student, 
with a mean expenditure level of $5. 70. 

The variables previously used, perce ntage of 
Student body participation, district wealth, school 
size, and maintenance level, are not related to 
Speech activity costs per partic ipant nor to costs 
per high school student. What little evidence there 
is suggests that it is more meaningful to use cost 
figures on an enrollment rather thanona partici- 
pant basis. 


Preparation of a Critical Instrument 


The eight highest and the eight lowest schools in 
total speech scores were analyzed. The percent- 
ages of students that observed each of the 41 prac- 
tices in their schools were computed, and the high 
Scoring schools compared, using at-testof the dif- 
ference of means. This procedure, revealed that 31 
of the items discriminated between high and low 
Schools at the . 05 level, several items being sig- 
nificant at the . 001 level. 


The effectiveness of speech activities in educat- 
ing for democratic problem-solving may be judged 
by this questionnaire which includes only items sig- 
nificant at the . 01 level. АП оғ the seniors are 
polled and directed to answer “уев” if they can do 
so for any time during their sophomore, junior, or 
Senior years. Then the mean number of practices 
observed per student is the school score in Speech 
activities. The questionnaire is shown in Table IV. 
А scatterplot of the responses to these 19 items 
Seems to indicate four levels of quality: 


School Scores Quality 
9.7 and up Superior 
17.4- 9.6 Satisfactory 
5.1- 7.3 Marginal 
.0-5.0 Unsatisfactory 
Summary 


An analysis of student responses shows that such 
factors as extensive student partic ipation, small 
School size, and high district wealth and school ex- 
penditure level are conducive to cons iderable stu- 
dent control in speech activities. 
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FIELD NEWS continued... 


EIGHT-YEAR STUDY OF ACADEMIC PREDICTION AND GROW TH. .... 

The College Entrance Examination Board and 
Educational Testing Service jointly are supporting an eight-year study to obtain a clear- 
er understanding of the way in which academic growth of pupils is related to kinds of 
School program, as well as to individual activities, abilities, and sex. In the fall of 
1961 а battery oftests were administered to 40, 000 fifth-, seventh-, ninth-, and 
eleventh-grade students inabout 170 schools. The same students, except for the elev- 
enth graders, will take the same tests at two-year intervals. The battery consists of 
School and College Ability Tests, the Sequential Tests of Educational Progress, and 
Tests of General Information. Dr. Scarvia B. Anderson is the director of the project. 
A longitudinal study of this kind should be a valuable contribution to education. 


QUANTITATIVE RESEARCH LABORATORY ESTABLISHED AT STANFORD..... 

A Laboratory for Quantitative Research has 
been established at Stanford University to conduct research in the following areas: in 
learning theory and its application to educational problems, teaching machines, evalu- 
ation of courses and techniques of teaching, mathematical elements of test construc- 
tion, application of management science to schooladministration, This laboratory will 
be directed by Professor Herbert Solomon. 


RESEARCH IN THE COLORADO SPRINGS PUBLIC SCHOOL SYSTEM..... 
Ina recent study, Ada M. Grant and Roland 
A. Aplin conducted a study to determine if there were significant differences of stu- 
dents’ scores over a one-year interval of the School and College Ability Tests and the 
Sequential Tests of Educational Progress. Approximately 100 cases at four high-school 
grade levels were examined, In general, no differences were found. 


CONANT TEACHER EDUCATION STUDY..... 

In September 1961 James B. Conant officially 
launched a study of the education of teachers. Dr. Conant said that he was encouraged 
to make a study in this controversial area by leaders іп the field. In the first year, 
Dr. Conant plans to limit his inquiries to the preparation of elementary school teach- 
ers and secondary schoolteachers inthe academic subjects. In the second year he will 
study the preparation of other teachers and special service personnel as well as mat- 
ters of certification and accreditation. He plans to visit a limited sample from the 
more than 1100 colleges and universities which prepare teachers. 


OFFICE OF EDUCATION REPORT ON UNDER-ACHIEVERS. .... 

The U. S. Office of Education reports in an 
eighty-five page publication, Guidance for the Under-Achiever with Superior Ability, 
that twenty to thirty percent of the capable students in high schools and colleges of the 
nation do not achieve the full measure of academic success commensurate with their 
abilities. This report presents the observations and conclusions of a team of experts 
invited by the Office of Education to study the under-achiever. Many under-achievers 
were found to have problems relating to mental health, emotional stability, and social 
adjustment. In general, their behavior is considered to be less mature than the aver- 
age of their own age group. They tend to evaluate themselves and others negatively, 
show a higher degree of hostility, and have stronger feelings of inferiority. 


EXPERIMENTAL PROGRAMS IN HIGH SCHOOLS. .... 
A tentative statement of policy for approval, 


direction, and termination of experimental programs in high schools was adopted at the 
Continued on page 282.... 
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Self-Appraisal of Advanced Degree Candidates 


THE CHARAC TERISTICS and abilities conducive 
to success in graduate study, research, and report 
writingare matters of concern, of course, to all 
prospective graduate students as well as to the col- 
lege and university staff members who have respon- 
Sibility for advanced degree work. No measuring 
instrument or combination of instruments has yet 
been devised which can serve as more than a guide 
tograduate counseling, but such guides can be useful, 
and the search continues for better ones. 

This investigation is based on the assumption that 
the students who venture toward graduate work have 
а considerable amount of self-understanding. There 
isnoclaim that this self-understanding is objectively 
measurable. But it is believed that the student's 
Self-concepts, as hefaces advanced studies, can be 
helpful indicators of his probable achievement in the 
graduate school. 


Problem Тһе problem was to discover to what ex- 
tent candidates for post-baccalaureate degrees in 
a graduate school іп ап independent setting con- 
Sideredthat they possessed certain characteristics, 
abilities, and attitudes commonly thought tobe 
valuable to the scholar and research worker. 


Procedure Subjects in the investigation were the 
Students in the 16 sectionsofa research techniques 
Course given in various sessions over a period of 
10 years, from the summer of 1949 to the fall sem- 
ester of 1958-1959. The total number of cases was 
402. Major fields of study represented among these 
Students included education, history, physical ed- 
ucation, psychology, religious education, sociology. 
Use was made of the Score Card for Fitness for 
Graduate Study, developed by C. E. Seashore. This 
blank provides for the scoring of ten t raits, atti- 
tudes, and abilities, as follows:reasoning power, 
originality, memory, alertness, accuracy, applica- 
tion, cooperation, moral attitude, health, andzeal 
for investigation. (See Table I. ) Self- evaluation lies 
along a six-point scale of **very poor, "' “роог,” 
‘low average, ” «high average,” “excellent, " and 
“superior.’? 
. The students were asked by the instructor (this 
Investigator in every case) to appraise their own 


WILLIS N. POTTER 
University of the Pacific 


traits, attitudes, and abilities in comparison with 
those of other prospective graduate candidates whom 
they knew, and with reference also to their under- 
Standing of the importance and values of the items 
listed. It was urged that each student providea 
“гие”? self-evaluation as far as possible, without 
using the judgments of any other individuals. The 
Score sheets were returned unsigned to the instruc- 
tor. 


Treatment and Interpretation of Data Results of 
the self-inventory by the cases were converted 
into simple percentages of the responses appearing 
at the various points along the scale for each item. 
These percentages are shown in Table I. 


When score values from 1 to 6 were assigned to 
eachpoint onthe scale, the total score of each trait, 
attitude, or ability could be computed, and they 
could be ranked with reference to the student’s 
self-evaluations. These scores and the ranking ap- 
pear in Table II. 

Analysis of the data indicated that the advanced 
students gave themselves relatively high ratings 
with respect tosuch items as moral attitude, health, 
and cooperation. In fact, no student rated himself 
below **low average’’ in moral attitude, and in health 
and cooperation only three each checked “ poor?’ 
At the same time, only onthesethree items did any 
considerable percentage of students consider them- 
selves to be *'superior. "' 

Memory, originality, and application held con- 
Spicuously low ranks. The students seldom found 
among their traits a brilliant and creative imagi- 
nation or a facile command of facts. It is significant 
alsothat they generally doubted their powers of con- 
centrated effort. 

When the ten traits were arbitrarily placed in 
four categories, as indicated below, a further ob- 
servation was possible: 


Rank 
Ethical factor: moral attitude I 
Physical factor: health 2 
Volitional and 
emotional factors: cooperation 3 
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TABLE I 


PERCENTAGES OF SELF-RATINGS BY 402 ADVANCED STUDENTS 
ON SCORE CARD FOR FITNESS FOR GRADUATE STUDY 


Traits, Attitudes, and Very Poor Low High Excel- Superior 
Abilities Poor Ave. Ave. lent 
1 2 3 4 5 6 


Reasoning Power: capacity 
for solving problems, both 
deductive and inductive. ‚14 13.18 62.44 22.64 1.00 


Originality: creative imagi- 

nation, brilliancy, planful 

initiative, and fertility of 

rational ideas. 3.73 27.11 46.77 18.91 3.48 


Memory: extensive, logical, 
serviceable, and ready com- 
mand of facts. 1.24 5.22 34. 58 44.03 12.94 1, 99 


Alertness: quick, incisive, 
and responsive observation, 
thought, and feeling. .74 10. 45 57. 98 26.87 3. 98 


Ассигасу: ргесіѕе, Кееп, 
regular, and reliable obser- 
vation, thought, and feeling. 1.74 19.65 58. 96 18.16 1.49 


Application: power of con- 
centration, sustained atten- 
tion, persistence. .25 3.48 24.38 43.28 24.88 3.73 


Cooperation: capacity for 
intellectual companionship, 
teamwork, and leadership. ‚14 6.47 38.31 44. 03 10.45 


Moral attitude: iatellectual 
honesty. wholesome mora! 


standards, ideals, and influ 
ences. 2.74 36. 07 15.52 15.67 


Health: nervous stability, 
physique, vitality, and 


endurance. .74 8.71 28.11 45.27 AULT 
Zeal for Investigation: deep 
interest in and craving for 

.25 2.99 24. 33 43.59 21.39 7.21 


original and creative work. 


____ мшш 5 om M uM о d T 


Form adapted from the score card developed by C. E. Seashore. See The Gifted Student and Research, Twenty- 
fourth Annual Conference of the Association of American Universities (Washington, D.C., National Research 


Council, 1922), pp. 39-48. 


POTTER 


281 


TABLE II 


RANKING OF TRAITS, ATTITUDES, AND ABILITIES IN SELF- RATINGS 
OF 402 ADVANCED STUDENTS ON FITNESS FOR GRADUATE STUDY 


————————Á—Á—À M—— € 
с_—————Є——Є——Є————— 


Traits, Attitudes, and Abilities 


у УУУ _ ы a rs rn 


Moral Attitude 
Health 

Cooperation 
Alertness 

Reasoning Power 
Accuracy 

Zeal for Investigation 
Application 
Originality 


Memory 


zeal for 

investigation 

application 

alertness 

reasoning power 
accuracy 

originality 

memory 1 


Intellectual factors: 


SORT » o ч 


. The students thought themselves to be compara- 
tively well fitted for graduate study and research 
as far as their ethical and physical qualifications 
were concerned. But they seemed to feel consider- 
ably less security in relation to important intellect- 
ual traits and abilities. 


Total Score Rank 
1906 1 
1887 2 
1837 3 
1700 4 
1648 5 
1630 6 
1626 7 
1609 8 
1573 9 
1480 10 


Conclusions The individual student’s recognition 

of areas of strength and weakness in his character- 
istics and abilities as aprospective degree candidate 
would seem to be a profitable experience in a re- 
search techniques course. 

The self-concepts of graduate students can provide 
interesting and useful information for group coun- 
seling purposes. 

Prospective researchers and degree candidates 
often have strong misgivings about certain of their 
qualifications, particularly the intellectual. If they 
can learn to understand themselves clearly, a more 
effective approach can be made to strengthening 
them, and in some cases saving them, as effective 
workers in graduate education. 
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FIELD NEWS  continued.... 


fall meeting of the administrative Committee and State Chairmen of the Commission 
on Secondary Schools, held in Madison, Wisconsin. Elaboration of this policy 
statement may be found in the North Central Association report, Today, November- 
December 1961. This report may be obtained from the North Central Association of 
Colleges and Secondary Schools, 5454 S. Shore Drive, Chicago 15. 


THE RELATIONSHIP BETWEEN SCHOOL GRADES AND AUTOMOBILE USAGE has been studied 
by the Allstate Insurance Companies. Astudy of twenty thousand students in the junior 
and senior grades of twenty-nine high schools in the United States and Canada showed 
that seventy-two percent had learnedto drive and twenty-five percent owned their own 
cars. The drivers contributed sixty-seven percent of the “А” students, sixty-nine 
percent of the *B's", seventy-two percent of the ‘‘C’s’’, seventy-six percent of the 
**D's", and eighty-two percent of the “‘F’s’’. Тһе records of car owners and of those 
who drove to school were somewhat worse. Тһе evidence suggests that poorer stu- 
dents, perhaps because they direct their interests away from school, more likely be- 
come car owners and that after obtaining a car there occurs further deterioration in 
school work. Animportantfactor discovered in this study of the relationship between 
academic performance and car usage wasthat it was not the automobile in itself, nor 
the part-time job often held to support it, but the degree of parental lattitude permit- 
ted in use of the automobile: “Іп comparison after comparisonthe underlying signif- 
icance tends to center about the exercise of restraints and discipline by parents. '' 


SUGGESTIONS FOR EXPERIMENTATION IN PROGRAME D INSTRUCTION are set forth in the 
December 1961 issue of Programed Instruction (available from The Center for Pro- 
gramed Instruction, Inc., 363 West End Avenue, New York 24, N. Y.). The most 
common types of experimentation involve: 1) the comparison of machine and textbook 
presentation of programed materials, and 2) the comparison of programed and con- 
ventional instruction. Тһе Center believes that neither type is practical or useful at 
this time. Instead, the Center recommends three other approaches: case studies, 
field testing, and implementation. 


SOME RECOMMENDATIONS TO EASE THE SHORTAGE OF COLLEGE TEACHERS have been put 
forth by the Fund for the Advancement of Education. After two years of experimen- 
tation, the Fundfoundthat almost every device with which investigators “ехрегішепі- 
ей” seems to work. Large lecture classes, television, non-professional assistants, 
and independent study help considerably in solving ‘‘the impending problem of the 
shortage of college teachers'' (Better Utilizationof College Teaching Resources, New 
York, Fund for the Advancement of Education, 1 


EMPHASIS ON CONTRACT RESEARCH RATHER THAN EDUCATION is a criticism directed to- 
ward some universities by J. Kenneth Little of the University of Wisconsin. Conduct- 
ing a U. S. Office of Education survey of federal programs in higher education, he 
found that at least six major universities received more than fifty percent of their op- 
erating budgets from federal grants during 1958-59. 

A study conducted by the National Opinion Research Center of the University of 
Chicago indicates that *unbalancing" which results from federal programs gives а 
better chance to low ability natural science students entering graduate training than 
superior students in the social sciences and humanities. 


ж Жу ЖЕТ Ж 
Readers are invited to send items of research interest to 


John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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An Experimental Effort to Improve Instruction 


Through Visual Feedback 


THOSE WHO engage in the preparation of teach- 
ers have found that the beginning teacher frequent- 
ly appears indifferent to the effect his performance 
is having upon pupils. Failure to note the conse- 
quences of instructional practice andto alter teach- 
ing acts appropriately has been attributed to the 
Student teacher's ‘‘self-consciousness, ’’ ‘‘subject- 
matter centeredness, ’’ ‘‘ritualistic tendencies," 
and the like. Supervising teachers attempt to help 
novices overcome this deficiency by giving emotion- 
al support, making goals explicit, suggesting new 
approaches and cooperatively analyzing progress of 
pupils. 

Although this assistance is generally considered 
Salutary, it is uneconomical of the supervisor’s 
time. It is important, therefore, that attention be 
given to the search for means by which teachers 
can more efficiently (a) recognize the difficulties 
pupils are encountering during instruction and (b) 
introduce variations in their tea c hing on the basis 
of responses by pupils. 


Problem 


The specific hypothesis of this study was that if 
Student teachers were provided with immediate 
knowledge of selected effects of their teaching upon 
pupils during instruction (feedback), then these 
teachers would more rapidly acquire a larger rep- 
ertoire of appropriate teaching behaviors. This hy- 
pothesis is consistent with the theoretical position 
that most of man's behavior is guided by cues that 
һе is, or is not, proceeding toward desired goals. 
The notion, derived from communication theory, 
that a teacher must be guided by continuous feed- 
back so that he modify his delivery, pace, level of 
abstraction, and so forth, isa common one (1). 
Communication theorists believe that the commun- 
ication of required messages is improved by the 
Completion of the circuit between sender and re- 
Ceiver (2). The value of immediacy in acquiring 
knowledge of results has been empirically demon- 
Strated in other learning situations (3). 

Effects noted by the student teacher were those 
Tesponses by pupils which demonstrated the pres- 
ence or absence of the behavioral outcomes sought 
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by instruction. Appropriate teaching procedures 
were indicated by the extent to which the student 
teacher was perceived by his pupils as one who: 

1. finds out which pupils need special help, 

2. helps individual pupils overcome learning 
difficulties, 

3. gives work which pupils believe they need, 

4. recognizes the difficulties in an assignment 
and prepares pupils for the hard parts, 

5. explains things so that the pupils can under- 
stand, 

6. knows when pupils really understand the work, 

7. makes clear what pupils will know and be a- 
ble to do at the end of the semester, 

8. shows pupils how past work is connected to 
work they are doing now and willdo in the 
future, 

9. plans with pupils that which needs to be done 
next in order to continue learning, 

10. arranges teaching conditions so that pupils 
do not feel embarrassed in answering ques- 
tions, 

11. makes the work interesting, 

12. runs a well-plannedand well-organized class, 

13. knows his subject, 

14. checks to see whether the pupil’s classwork 
is correct, 

15. has ready the things he needs for the class- 
work, 

16. gives clear directions, 

17. asks pupils to help in planning new work to 
be undertaken, 

18. gives pupils many chances to show what they 
have learned to do, 

19. gives each pupil a chance to work on some- 
thing of special interest, 

20. moves on to something else when class has 
caught on, doesn't drag out the lesson, 

21. stays with an idea untilthe pupils understand, 
doesn't jump on to something else right away. 


Subjects 


Ten randomly assigned studentteachers who had 
been teaching for a two-week period inthe field of 
English-Social Studies at the junior high level were 
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the subjects. The 307 pupils in the classrooms of 
these teachers provided data regarding the instruc- 
tional procedures used in their respective situa- 
tions. Four ofthe teachers were in ControlGroup 
1, and the others were divided equally into Control 
Group 2 and the Experimental Group. Those іп Con- 
trol Group 2 taught under normalconditions with the 
exception of the administration by the researcher of 
the pre- and post-questionnaires evaluating the be- 
havior of the student teacher. Those іп Control 
Group 1 and in the ExperimentalGroup taught under 
conditions to be described insubsequent paragraphs. 


Procedures 


An “answer light’’ (4) which displayed indepen- 
dently four colored lights was mounted on the desk 
of each pupil in the Experimental Group. Тһе “ап- 
swer light'' was developed to provide a means for 
immediate indication of pupil mastery of the lessons 
at hand. The light was visible only to the teacher 
and was turned on by pupils using number ed push 
buttons. This arrangementafforded privacy for pu- 
pils in responding to questions, lessening peer 
group influence. Directions for the use of the light 
меге given by the supervising teacher who explain- 
ed that, ‘It is being introduced for a short experi- 
ment, ’’ 

Daily, student teachers in the Experimental 
Group and in Control Group 1 prepared a number of 
multiple-choice questions which called for demon- 
stration by all pupils of their ability either to recall, 
identify, compare, contrast, illustrate, predict, or 
apply the major principle or concept being consider- 
ed during the lesson. These questions were pre- 
sented to the class at appropriate intervals during 
each class session for a period of five weeks. An- 
swers to these questions were given inthe experi- 
mental classrooms by pupils using the multi-colored 
answer lights. The student teacher was instructed 
to note the responses and to reteach as necessary 
before continuing the sequence of instruction. Stu- 
dent teachers in Control Group 1 presented their 
questions for answering without benefit of lights. A 
copy of all questions systematically prepared along 
with both the behavioral responses they were de- 
signed to elicit (e.g., recall, identify, apply, etc.) 

and the actual responses brought forth were retain- 
ed for analysis. 

Student teachers were encouraged to present 
questions by giving them orally while pointing to 
numbers 1, 2, 3, or 4 written on the board. It was 
suggested that the questions often be accompanied 
by material which would allow for problem-solving 
and transfer. Problems could be introduced using 
media such as tape-recordings, film strips, opaque 
projectors and flip charts. 

Before the preparation of questions and the in- 
stallation of the answer light, a questionnaire was 
administered to determine the extent to which pupils 
in experimental and control groups perceived the 


student teacher engaging in the previously listed de- 
sirable practices. An identical post-questionnaire 
was administered five weeks after the introduction 
of the experimental conditions. Pupil respondents 
were advised that no one other than the experimen- 
ter would see their individual evaluations of the stu- 
dent teachers and that their opinions would make no 
difference on the student teacher’s grade or standing 
in the University. 

Analysis of the data was interms of mean gain in 
teacher behavior under thesethree conditions: (a) 
systematic provision for both questions and imme- 
diately visible responses from all pupils, (b) sys- 
tematic provision for questions but no special op- 
portunities for recognizing pupil response, (c) 
chance presentation of questions and responses. The 
association between change in teacher behavior and 
the kind of behavior required in test items (recall, 
comparison, etc.) was also sought. The quality of 
the questions was determined by three professional 
judges who rated questions onthe basis of avoidance 
of trivial items and irrelevant cues and provision 
for sequence. Pupils and teachers were invited to 
express their opinions as to the usefulness of the 
answer light. 


Results 


Scores for each studentteacher were determined 
by computing the arithmetic mean of evaluations 
from his pupils at the beginning and end of the ex- 
perimental period. Correlation coefficients (Table 
I) between practices at the beginning and terminal 
periods indicated no significant change with respect 
to the frequency of appropriate teaching behavior 
in either experimental or control groups. 

The post-test ranges inscores assigned by pupils 
to individual teachers ona scale of 1 to 5 were: 
with the Experimental Group 2.20 to 4.80; with Con- 
trol Group One 2. 17 to 4.83; with Control Group 
Two 1.97 to 4.53. 

Pupils, student teachers and supervising teach- 
ers in the experimental classrooms expressed the 
opinion that when the answer light was used, pupils 
felt under obligation to listen more intently than us- 
ual, the teacher knew more about the pupils' learn- 
ing than he could have learned in any other way, and 
students learned more because of this aid than with 
any other device. 

There was nearly a one-to-one correspondence 
between the rank order of the student teacher with 
respect to appropriate teaching behavior and the 
quality of questions he prepared. Those with the 
best questions were mostírequently seen as engag- 
ing in desirable practice. Student teachers report- 
ed difficulty in the preparation of test items, at 
times revealing an inability to conceive situations 
which would call forth that behavior on the part of 
the pupils which was associated with desired out- 
comes. In short, behavioral outcomes were not 
always clear to the student teachers themselves, 
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CORRELATIONS BETWEEN STUDENT TEACHERS’ INSTRUCTIONAL PRACTICES AT 
BEGINNING AND END OF A FIVE-WEEK PERIOD AS REPORTED BY PUPILS 


SS 


Experimental Group 


Control Group 1 


Control Group 2 


. 490 . 459 . 603 
.532 . 462 .672 
. 613 .651 . 887 
.819 
and questions often lacked relevancy and continuity. Summary 


Discussion 


These results direct attention to the classroom 
applicability of the trialanderror concepts of learn- 
ing and the information theories of the Cybernetic- 
ists (5). They also have implications for the prep- 
aration of teachers. 


Within the limitations of this research, we can 
suggest that the kind of information needed for im- 
proved performance by the teacher must be more 
than *timmediate knowledge of results.''Itis not 
enough that the teacher receive simple correct-in- 
Correct messages as feedback. A teacher needs 
Specific information whichrevealsthecrucial un- 
certainties in the pupil’s thinking before the teacher 
can act more appropriately. The bit of information 
Which shows that the student does or does not know 
what is expected of him is important, but the teach- 
er must reduce his own uncertainty further by dif- 
ferentiating particular misconceptions in the minds 
of students. He should also have information which 
States how his own performance can be modified so 
as to achieve correct behavior. During their prep- 
aration, student teachers should be (a) given oppor- 
tunities to practice the diagnosing of difficulties en- 
countered in learning and (b) provided with illustra- 
tions which offer a range of specific acts for deal- 
ing with problems of instruction. In addition, the 
behavioral outcomes sought must be clearly per- 
Ceived by all concerned. 


It is submitted that if feedback to the novice in- 
cludes contiguous (a) indication of the extent to 
Which pupils understand the lesson, (b) know ledge 

: of the obstacles encountered by the pupil in his 
learning and (c) explanation as to how the teaching 
Performance сап be modified in the light of this in- 
formation, there will be improved instruction. 


The five-week performance of three groups of 
Student teachers, each under a different condition, 
was studied through evaluations by pupils. Теасһ- 
ers in the experimental group sy stematically pro- 
vided stimuli and channels for visual feedback which 
indicated results from the teaching perfor mance. 
Teachers in one control group systematically asked 
questions to obtain information as to whether or not 
pupils understood, but had no special channel for 
receiving the information. Teachers in a third 
group taught without experimental arrangements. 
Differences arising from these conditions in terms 
of change in the performance of student teachers 
were not significant. The study seems to have im- 
plications for the preparation of teachers and com- 
munication theory. 
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the subjects. The 307 pupils in the classrooms of 
these teachers provided data regarding the instruc- 
tional procedures used in their respective situa- 
tions. Four of the teachers were in ControlGroup 
1, and the others were divided equally into Control 
Group 2 and the Experimental Group. Those іп Соп- 
trol Group 2 taught under normalconditions with the 
exception of the administration by the researcher of 
the pre- and post-questionnaires evaluating the be- 
havior of the student teacher. Those in Control 
Group 1 and in the Experimental Group taught under 
conditions to be described in subsequent paragraphs. 


Procedures 


An “ʻa ns wer light” (4) which displayed indepen- 
dently four colored lights was mounted on the desk 
of each pupil in the Experimental Group. The “ап- 
swer light" was developed to provide a means for 
immediate indication of pupil mastery of the lessons 
at hand. The light was visible only to the teacher 
and was turned on by pupils using number ed push 
buttons. This arrangementafforded privacy for pu- 
pils in responding to questions, lessening peer 
group influence, Directions for the use of the light 
were given by the supervising teacher who explain- 
ed that, ‘It is being introduced for a short experi- 
ment. ” 

Daily, student teachers in the Experimental 
Group and in Control Group 1 prepared a number of 
multiple-choice questions which called for demon- 
Stration by all pupils of their ability either to recall, 
identify, compare, contrast, illustrate, predict, or 
apply the major principle or concept being consider- 
ed during the lesson. These questions were pre- 
Sented to the class at appropriate intervals duri ng 
each class session for a period of five weeks. An- 
Swers to these questions were given in the experi- 
mental classrooms by pupils using the multi-colored 
answer lights. The student teacher was instructed 
to note the responses and to reteach as necessary 
before continuing the sequence of instruction. Stu- 
dent teachers in Control Group 1 presented their 
questions for answering without benefit of lights. A 
Copy of all questions Systematically prepared along 
with both the behavioral res ponses they were de- 
Signed to elicit (e.g., recall, identify, apply, etc.) 
and the actual responses brought forth were retain- 
ed for analysis. 

Student teachers were enco uraged to present 
questions by giving them orally while pointingto 
numbers 1, 2, 3, or 4 written on the board. It was 
suggested that the questions often be accompanied 
by material which would allow for problem-solving 
and transfer. Problems could be introduced using 
media such as tape-recordings, film Strips, opaque 

projectors and flip charts. 

Before the preparation of questions and the in- 
Stallation of the answer light, a questionnaire was 
administered to determine the extent to which pupils 
in experimental and control groups perceived the 


student teacher engaging in the previously listed de- 
Sirable practices. An identical post-questionnaire 
was administered five weeks after the introduction 
of the experimental conditions. Pupil respondents 
were advised that no one other than the experimen- 
ter would see their individual evaluations of the stu- 
dent teachers and that their opinions would make no 
difference on the studentteacher's grade or standing 
in the University. 

Analysis of the data was interms of mean gain in 
teacher behavior under thesethree conditions: (a) 
Systematic provision for both questions and imme- 
diately visible responses from all pupils, (b) sys- 
tematic provision for questions but no special op- 
portunities for recognizing pupil response, (c) 
chance presentation of questions and responses. The 
association between change in teacher behavior and 
the kind of behavior required in test items (recall, 
comparison, etc.) was also sought. The quality of 
the questions was determined by three professional 
judges who rated questions onthe basis of avoidance 
of trivial items and irrelevant cues and provision 
for sequence. Pupils and teachers were invited to 
express their opinions as to the usefulness of the 
answer light. 


Results 


Scores for each studentteacher were determined 
by computing the arithmetic mean of evaluations 
from his pupils at the beginning and end of the ex- 
perimental period. Correlation coefficients (Table 
I) between practices at the beginning and terminal 
periods indicated no significant change with respect 
to the frequency of appropriate teaching behavior 
in either experimental or control groups. 

The post-test ranges inscores assigned by pupils 
to individual teachers ona scale of 1 to 5 were: 
with the Experimental Group 2. 20 to 4.80; with Con- 
trol Group One 2.17 to 4.83; with Control Group 
Two 1. 97 to 4.53. 

Pupils, student teachers and supervising teach- 
ers in the experimental classrooms expressed the 
opinion that when the answer light was used, pupils 
felt under obligation to listen more intently than us- 
ual, the teacher knew more about the pupils’ learn- 
ing than he could have learned in any other way, and 
students learned more because of this aid than with 
any other device. 

There was nearly a one-to-one cor respondence 
between the rank order of the student teacher with 
respect to appropriate teaching behavior and the 
quality of questions he prepared. Those with the 
best questions were mostfrequently seen as engag- 
ing in desirable practice. Student teachers report- 
ed difficulty in the preparation of test item s,at 
times revealing an inability to conceive situations 
Which would call forth that behavior on the part of 
the pupils which was associated with desired out- 
comes. In short, behavioral outcomes were not 
always clear to the student teachers themselves, 
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CORRELATIONS BETWEEN STUDENT TEACHERS' INSTRUCTIONAL PRACTICES AT 
BEGINNING AND END OF A FIVE-WEEK PERIOD А5 REPORTED BY PUPILS 
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Control Group 2 


Experimental Group Control Group 1 


490 .459 603 

532 . 462 672 

613 .651 887 
819 
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Of the obstacles encountered by the PUP 


and questions often lacked relevancy and continuity. 


Discussion 


These results direct attention to the c lassroom 
applicability of the trialanderror concepts of learn- 
ing and the information theories of the Cybernetic- 
ists (5). They also have implications for the prep- 
aration of teachers. 


Within the limitations of this research, we can 
suggest that the kind of information needed for im- 
proved performance by the teacher must be more 
than “immediate knowledge of results. "' Itis not 
enough that the teacher receive simple correct-in- 
correct messages as feedback. А teacher needs 
Specific information which reveals thec ruc ial un- 
certainties in the pupil’s thinking before the teacher 
can act more appropriately. The bit of information 
which shows that the student does or does not know 
what is expected of him is important, but the teach- 
er must reduce his own uncertainty fur ther by dif- 
ferentiating particular misconceptions in the minds 
of students. He should also have information which 
states how his own performance can be modified so 
as to achieve correct behavior. During their ргер” 
aration, student teachers should be (a) given oppor- 
tunities to practice the diagnosing of difficulties en- 
Countered in learning and (b) provided with illustra- 
tions which offer a range of specific acts for deal- 
ing with problems of instruction. In addition, the 
behavioral outcomes sought must be © learly per- 
Ceived by all concerned. 


It is submitted that if feedback to the novice in- 
cludes contiguous (a) indication of the extent to 
which pupils understand the lesson, (b) know ledge 

il а 
learning and (c) explanation as to how the teac 
performance 9 be modified in the light of this in- 
formation, there will be improved instruction: 


Summary 


The five-week performance of three groups of 
student teachers, each under a different condition, 
was studied through evaluations by pupils. Teach- 
ers in the experimental group sys tematically pro- 
vided stimuli and channels for visual feedback which 
indicated results from the teaching performance. 
Teachers in one control group systematically asked 
questions to obtain information as to whether or not 
pupils understood, but had no special channel for 
receiving the information. Teachers in a third 
group taught without experimental arrangements. 
Differences arising from these conditions in terms 
of change in the performance of student teachers 
were not significant. The study seems to have im- 
plications for the preparation of teachers and com- 


munication theory. 
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Item Analysis and the Measurement of Change" 


THE PROBLEM of item selection is complicated 
by the fact that each itenr will change in difficulty 
by some unknown amount as time elapses. If no ad- 
ditional training is given, the item will probably in- 
crease in difficulty as forgetting proceeds. If ad- 
ditional training is given, as is typically the case 
with students taking the firstsemester of a one-year 
introductory course, some items ma y conceivably 
decrease in difficulty. In either case, the amount 
of change is unknown, and, indeed, sometimes the 
direction of change, making it difficult to apply ra- 
tional techniques of item selection. 

The measurement of retention is particularly 
sensitive to the difficulty of the task or item; if the 
task is too difficult, performance may by uniformly 
low, in which case, differences betweengroups will 
be masked. In laboratory experimental work on 
learning, the savings method inrelearning provides 


the most sensitive measure of retention and has, 


been generally adopted wherever macimalsensitiv- 
ity is desired. 

Several writers (e.g., Cronbachand Warrington 
(1), Guilford (2), Gulliksen (3), and Lord (4) , have 
dealt with the relation between the ability of the 
group tested and the difficulty of the items to be 
used in constructing a sensitive test. Lord (4), for 
example, points out that a test with maximal dis- 
Criminative power for examinees at a given ability 
level should have all items of the same diffic ulty, 
such that half of the examinees at the given ability 
level will pass the item and half will fail it. Ina 
test-retest study designed to measure course reten- 
tion, we may consider that we are comparing test 
Performance at two levels of ability: before andaf- 
ter forgetting. Therefore, knowledge of the dis- 
criminative power of an item in the initial test рго- 
Vides no clear indication of its discriminative power 
in the re-test, unless the amount of forgetting (i.e., 
the new ability level) is known in advance. But, of 
сошве, the amount of forgetting is exactly what we 
are trying to find out. 

This problem arose during an experimental com- 
parison of two methods of teaching general psychol- 
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ogy—a conventional course with three class meet- 
ings per week and a self-directed study course with 
about one class meeting per week. It was desired 
to measure retention of course materials after an 
interval of 15 months, a situation in which, unfor- 
tunately, retention may be expe cted to be low and 
the importance of the sensitivity of the instrument 
looms large. 

Having no basis for a rational selection of items 
at an optimal difficulty or discrimination level, it 
was decided to construct the test in such a way as 
to provide some information for future experiments 
of this kind. This approach had the advantage of 
hedging our bets. By using items of varying but 
known discriminative power and difficulty in the 
pre-test, it appeared moderately certain that some 
of them would be useful in the post-test. 

The present report deals, therefore, with two 
hypotheses: 

1. Items that shift in difficulty toward the opti- 
mal level of difficulty will become more discrimin- 
ating with the passage of time, and items that shift 
away from the optimal level of difficulty will become 
less discriminating. 

2. Ina situation where forgetting occurs, items 
that were initially much easier than ће optimal dif- 
ficulty level will shift toward the optimal level and 
will consequently have greater discriminative pow- 
er in the re-test than items that were initially clos- 
er to the optimum in difficulty. 

The first hypothesis is more general, but has 
the limitation that it cannot serve as the basis for 
item selection until the re-test has been administer- 
ed. The second hypothesis requires an assumption 
about the direction of change in ability in the popu- 
lation being tested, and with that restriction it can 
provide a guide to the selection ofitems for the re- 
test on the basis of initial test results. 


Procedure 


Item selection. Students who had participated in 
an experiment in the teaching of general psychology 


* The research reported herein was performed pursuant to а contract with the United States Office of Educa- 
tion, Department of Health, Education, and Welfare. 
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Were to be re-tested after an interval of 15 months. 
The re-test included an attitude tneasure, a test of 
critical thinking in psychology, and a measure of 
retention of factual material. The problem of item 
analysis concerned only the test of factual know- 
ledge, which was to be constructed of items taken 
verbatim from the finalexamination; it was feasible 
to devote 15 minutes of a one-hour test session to 
this part of the test. 

First, an item analysis of the original test of 43 
items was performed, using the 100 high scorers 
and the 100 low scorers from a total of 521 students 
taking the final examination as the criterion groups. 
Then the phi-coefficient for eachitem was calculat- 
ed from an abac (Guilford (2), p. 431. All items 
with high discriminating power, phi=.40, were se- 
lected for use in the re-test; there were twelve such 
items. From the remaining items, all items were 
eliminated in which the initial proportion of Ss pas- 
sing the item was near chance (too difficult) orin 
which the initial proportion correct in the 100 low 
Scorers was over . 80 (too easy), Eight of the eas- 
iest items with the lowest values of phi were select- 
ed. This procedure yielded a set of 20 items ade- 
quate to test the above hypotheses: the range of val- 
ues of phi was from . 06 to . 74 andthe range of val- 
ues of percent correct was from 34 to 83 percent. 
It should be noted that these values are based on the 
class as a whole. Since our procedure for selecting 
Ss deliberately curtailed the low end of the distribu- 
tion, it was to be expected that the items would be 
Somewhat easier for the sample actually used in the 
re-test, 

Subjects. Since the main point of our study was 
а comparison of retention after two different educa- 
tional treatments, it seemed important to select Ss 
who had learned enough from the course that they 
would not have forgotten all of it. On this basis, it 
was decided to use only Ss who had Scored in the top 
half of the class on the final examination (which was 
а composite of factual and problematic questions). 
Invitations to participate in the Study were sent to a 
random sub-sample of 119 students. With the use 
of telephone follow-ups, 71 participants from the 84 
Students remaining on campus were obtained. 


Results and Discussion 
Ae8u!ts and Discussion 


Since the test was composed of four-alternative 
multiple-choice items, the optimaldifficulty level of 
50 percent of Ss knowingan item is represented by 
62. 5 percent of Ss giving the correctanswer. Using 
the scores of the 71 Ss actually participating in the 
re-test, the initial testand re-testcoefficients were 
calculated for each item. For the Same Ss, the 
percent correct on both tests was calculated for 
each item. The items were then subdivided into two 
groups, those for which the percent correct shifted, 

from initial test to re-test, toward the optimal level 
of 62.5 percent, and those which shifted away from 
this optimum. Table I shows the relation between 


change of difficulty level and change in discrimina- 
tive power. The first hypothesis is supported: i- 
tems for which the change in difficulty was toward 
the optimum generally increased in discriminative 
power; all items for which the change in difficulty 
was away from the optimum lost in discriminative 
power. 

In order to test the second hypothesis, the items 
were rank-ordered on level of difficulty inthe in- 
itial test and divided into two groups: eleven easy 
items with percent correct equal to or greater than 
the median and nine difficult items with percent 
correct less than the median. Tablell shows the 
relation between initial difficulty level and direc- 
rectional change in discriminative power. Although 
the trend is not significant, the second hypothesis 
is supported. The easier items tended to become 
more discriminating while the more difficult items 
tended to become less discriminating. 

Admittedly, these results are merely suggestive. 
This study suffers from severe restrictions both as 
to item sampling and subjectsampling. Ina sense, 
the fact that some systematic trends were found in 
Spite of these restrictions holds promise thata 
more thoroughgoing study of the problems discussed 
here will bear fruit. The theory of item selection 
has been profoundly studied insofar as it concerns 
conditions where ability is assumed to remain con- 
stant. In experimental studies of intellectual de- 
velopment, where the problem is to measure 
change, a new set of psychological assum ptions 
must be imposed on the theory of item selection. 


Summary 


А problem is raised for the theory of item anal- 
ysis: in longitudinal experimental studies, it may 
be desirable to construct tests that will be maxi- 
mally sensitive to change in a bility. If the ability 
of the population changes, items having maximal 
discriminative power in an initial test will be lower 
in discriminative power in subsequent tests. In the 
particular case Considered, the ability measured 
was presumed to decrease as the result of forget- 
ting. Results presented suggestthat item selection 
can be improved by taking into accountthe direction 
of change in ability within the population tested. 
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TABLE I 


RELATION BETWEEN CHANGE IN DIFFICULTY AND CHANGE 
IN DISCRIMINATIVE POWER OF ITEMS 


M ——————— 


Change in Difficulty Change in Discriminative Power 
Phi Increases Phi Decreases 


ly ee cy eR AB re M 


Toward optimum 13 3 


Away from optimum 0 4 
а ier nr ae ripe анык а негі 


Cell entries are item frequencies. 
Chi-square = 6. 058, < . 02, with Yates’ correction. 


TABLE П 
RELATION BETWEEN INITIAL DIFFICULTY AND CHANGE 
IN DISCRIMINATIVE POWER OF ITEMS 


Initial Difficulty Change in Discriminative Power 
Phi Increases Phi Decreases 


11 Easy items 8 3 


9 Difficult items 3 6 
Bee ee Бйз oS ae a UR 


Cell entries are item frequencies. 
Chi-square = 3.104, p< 10 
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Synthesizes 75 doctoral studies to offer 
practical insights for practitioners, and 
fresh theories for further research: 


e What are the criteria of teacher effectiveness? 


e What are the pertinent theories of the 
nature and structure of human abilities? 


€ Have statistical techniques been adequate? 


€ Are there discrepancies between teacher 
potenial and performance? 


e What are the personal and professional 
prerequisites to teacher effectiveness? 


€ Can good teachers be distinguished from 
poor teachers? 


This milestone monograph presents a 
critical overview of 30 years of important 
research into teacher effectiveness, 

and offers such general observations 

and new hypotheses as appear to 

be supported by the data. 


WISCONSIN 
STUDIES OF THE 
MEASUREMENT 
AND PREDICTION 
OF 

TEACHER 
EFFECTIVENESS 


BY A. S. BARR 
with Worcester, Abell, 


Beecher, Jensen, Peronto, Ringness, 
and Schmid 


as printed in the September 1961 
Journal of Experimental Education 


| 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 6, March 1961) 


A Survey of Status and Trends in Departmentalization 


in City Elementary Schools 


THIS STUDY, made early in 1959, was planned 
to develop an efficient and economical procedure to 
gather reliable current data about selected prac- 
tices in city elementary schools. The primary pur- 
pose of such surveys would be to determine the im- 
pact, on elementary school practices, of current 
criticisms of and pressures on the schools. In ad- 
dition the data could help establish trend lines. 


Purposes of This Survey 


The main purpose of this survey was to deter- 
mine whether or not proposals for subject matter 
emphasis in science, mathematics and reading Were 
reinforcing or reversing trends in departmentaliza- 
tion in city elementary schools. 

Another important purpose was to collect data 
about the current status of practices in depart- 
mentalization. 


Definitions 


_ For this research, department alization was de- 
fined as “а method of school organization in which 
a teacher or supervisor instructs in one subject or 
in one group of related subjects. ” Remedial teach- 
ers who work with individuals or special small 
groups of children were not considered as depart- 
mental teachers within this meaning. 

Elementary school included any school enroll- 


* Based on portions of a doctoral project с 
New York, N.Y. The author wishes to e 


Elsbree and Dr. Irving Lorge for their assistance а. 


sults. 


ress his appreciation to Dr. 
БЕ nd advice in planning the research and analyzing the re- 


ROLAND E. BARNES 
Queens College 


ing pupils below the seventh grade that was desig- 
nated as anelementary school by the state. Respond- 
ents were instructed to report data for the first six 
grades only, where schools also enrolled pupils in 
higher grades. 


Procedure 


A questionnaire designed to obtain two types of 
information was used in this study. The first part 
of the questionnaire elicited information about school 
enrollment, number of full-time teachers, grade or- 
ganization and the like. The second part asked for 
information about departmentalization; whether or 
not it was practiced in the school, subjects and 
grades that were departmentalized and when, ando- 
pinions about trends in departmentalization in each 
school system. 

Questionnaires were senttoa proportional strat- 
ified sample of 1, 018 city elementary schools, s uí- 
ficient for а 4% permissible error at the 0, 99 confi- 
dence level. Proportional stratification was ensured 
by selecting every fifth city from fivecity population 
size groups used by the U. S. Office of Education. 1 
Individual schools in these cities were selected at 
random in required numbers from the latest avail- 
able city and state educational directories. 

Eight hundred six usable returns (79.2%) were 
received; the sampling error was plus or minus 3. 3% 
atthe 0.99 confidence level. Returns came from 


ompleted in July 1959, at Teachers College, Columbia University, 


Harold J. McNally, Dr. Willard S. 


United States, 1952-54, Chapter III, *Sta- 


1. і nnial Survey of Education in the à 
Miel Staff, Pupils, and Finances - 1953-54" ashington, D. C.: Government 


tistics of City School Systems: 
Printing Office, 1956), pp. 38-153. 


2. For fi l tary schools were not liste 
qe onal Education Assoc 


lected from names listed in: Nati 


d in their directories. Schools in these states were se- 
iation, Department of Elementary Principals, ‘‘Mem- 


bership Directory and Annual Report," The National Elementary Principal, 38: 51-194, November, 1958. 
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principals in cities in all 49 states. For no state 
was the proportion of returns less than 50% of those 
to whom questionnaires were sent. Only 11 states 
had a proportion of returns of less than 70%. 


Major Findings 


The findings reported below refer to depart- 
mentalization in grades one to six only; this study 
was limited to practices in the first six grades of 
elementary schools. 

Proportion of DepartmentalizedSchools--Of the 
806 schools reporting, 351, or 44%, indicated that 
one or more subjects were departmentalized. Group 
V cities (population 2, 500 to 4, 999) hada significant- 
ly larger proportion (59%) of departmentalized 
schools, Group П cities (population 25, 000 
to 99,999) had the smallest proportion, 38%. 

Principals’ Opinions About Trends--Five hun- 
dred Тоту-Нуе, or 68% of the 806 principals indi- 
cated they thought the practice of departmentalization 
in their school systems was neither increasing nor 
decreasing. A significantly smaller proportion (59%) 
inGroup I cities (population 100,000 orover) 
checked this category. 

Ninety-seven principals (12%) thought that the 
practice was decreasing. Larger proportions in 
Group I and II cities, the two largest population size 
groups, thought the practice was decreasing than 
thought it was increasing. 

Eighty-five principals (11%) thought depart- 
mentalization was increasing. Larger proportions 
of principals in Group III, IV and V cities (popula- 
tions 10, 000 to 24, 999; 5, 000 to 9, 999; and 2,500 to 
4,999, respectively) thought the practice wasin- 
creasing than thought it was decreasing. 

About 10% of the principals failed to answer this 
item. 

Date of Departmentalizing--For the 351 schools 
reporting one or m or e departmentalized subjects, 
54% of the subjects listed were departmentalized 
before the school year 1952-53. Group II cities had 
a significantly larger proportion (67%) and Group IV 
cities had a significantly smaller proportion (34%) 
of these long-standing departmentalized subjects. 

About 33% of all subjects listed were depart- 
mentalized between the years 1952-53 and 1956-57. 
Group I cities had a significantly larger proportion, 
45%, of subjects added during this period, while a 
Significantly smaller proportion (195) were added 
for Group II cities. 

About 14% of all subjects were departmentalized 
during this or last year, 1957-58 or 1958-59. Group 
III, IV and V cities had larger proportions of sub- 
jeets that were newly departmentalized during this 

period, with Group IV cities having a significantly 
larger proportion, 27%. The two largest population 
groups reported a smaller proportion of newly de- 
partmentalized subjects, with GroupI cities having 


a significantly smaller proportion, 5%. 

Departmentalized Subjects--The ten most fre- 
quently mentioned departmentalized subjects in the 
351 schools were, in rank order: 1. тивіс, 
2. physical education, 3. art, 4. arithmetic, 5, 
Science, 6. reading, 7. social studies, 8. library, 
9. English, 10. language arts. Altogether, 28 dif- 
ferent subjects were listed. Few subjects other than 
music, physical education andart were department- 
alized below grade four. 

The subjects most frequently listed as having 
been newly departmentalized during 1957-58 or 
1958-59 were, in rank order:1. music, 2. reading, 
3. physical education, 4. arithmetic, 5. science. 
Each of these subjects was departmentalized main- 
ly in grades four, five and six, or combinations of 
these grades. The subjects of reading, arithmetic 
and science accounted for more than half of all those 
that were departmentalized this or last year. 

Number of Departmentalized Subjects Per 
School--About 64% of the 351 schools reported three 
or less departmentalized subjects, Four to six such 
subjects were reported by 2875 of the schools and 
Seven or more subjects were reported by 8% of the 
Schools. GroupI cities had a significantly larger 
proportion of schools with more than five depart- 
mentalized subjects. 

Size of Departmentalized Schools--A bout 40% of 
the 351 schools employing departmentalization en- 
rolled over 600 pupils. Twenty-eight per cent en- 
rolled less than 400 pupils. The modal enrollment 
was 400-599 pupils. Group I cities hada significant- 
ly larger proportionof schools with enrollments over 
800, while Group III cities hada significantly larger 
proportion enrolling 200-399. 

The median enrollment for each population group 
was: Group I, 660; Group П, 525; GroupIII, 420; 
Group IV, 460 and Group V, 530. 


Summary and Conclusions 


The findings from this research seem to indicate 
that there is no change inthe practice of department- 
alization in the majority of city elementary schools. 
However, it appears that there is a definite counter 
trend toward increasing departmentalization in 
schools in smaller cities, especially during the past 
two years. Three subjects which have been singled 
out for special emphasis in some proposals consti- 
tuted more than half of all newly departmentalized 
Subjects, 

It is probable that schools in smaller cities are 
more responsive to the influence of criticisms and 
pressures, thereby reflecting more quickly than 
large city schools the effects of proposals for change. 
A similar survey planned to be made about three 
years hence might reveal more definite and positive 
change in what presently appears to be a counter 
trend in the practice of departmentalization. 
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Book Reviews 


We are always pleased to receive scientific monographs and scholarly publications 
based upon research. Send to Wilson Thiede, Professor of Education, University 


of Wisconsin. 


BOOKS, YOUNG PEOPLE, AND READING GUID- 
ANCE, Geneva В. Hannaand Mariana K. McAl- 
lister. New York: Harper and Brothers, 1960, 


219 pp. 


This book is especially appropriate as a supple- 
mentary text to accompany a basic children's liter- 
ature text. The authors have presented information 
and ideas which willgreatly aidteachers and librar- 
ians in facing problems of selecting books for chil- 
drenof junior and senior high school age. They 
present many facts concerning reading practices of 
young children. Some ofthese are given lucid treat- 
ment and may alarm some readers but learning and 
change accompany recognition of such factors. 

The chapter on Books and the Needs of Youth 
lists and suggests literature for reading to satisfy 
the authors’ concepts of young people’s needs. Elev- 
еп needs are given. Books are suggested that may 
provide the insight or spark for behavioral under- 
Standing on the part of young readers. 

. Perhaps the most assuring section of the book 
is found in the chapter on Librarians and Teachers, 
Books and Young People. The statements concern- 
ing techniques quick reading, book background for 
the adult and tools for selecting books are direct and 
Sensible. 

. Few books have narrowedthe fieldof the teach- 
ingof literature for the junior and senior high-school 
age. These authors have successfully done so. The 
Strong statements of specific tasks that should be 
performed by teachers, librarians, and parents in- 
terested in having young people read books places 
the book high on the practical side. А few chapters 
àre weakened by the attempt to organize young peo- 
ple’s needs and to list books that possibly satisfy 
these needs. Adults who wish to provide the best 
books to all readers would profit by reading the books 
Tecom mended by the authors and conveniently re- 


Peated at the back of the book. 
David C. Davis 


University of Wisconsin 


TELEVISION IN THE LIVES OF OUR CHILDREN, 
Wilbur Schramm, Jack Lyle, and Edwin B. 
Parker. Stanford, California; Stanford Univer- 
sity Press, 1961. 


Ubiquitous television comes under the micro- 
Scope, and to a degree before the telescope in this 
many-faceted study. Inthese days of claims and 
counter claims it is reassuring to find conclusions 
which are backed up by evidence. 

The authors have brushed aside prejudices and 
accepted conceptsto start anew. They have attempt- 
ed to put television into better perspective in the life 
of the American child and his environment. 

At the outset, the authors make it clear that they 
are concerned with what childrendo with television, 
rather than with what television does to children. 
This is the approach used in considering pertinent 
questions. 

This study is not merely aquantitative analysis 
of television consumptionby children, Rather, it is 
a qualitative and psychological appraisal of the re- 
actions of young viewers to the medium. It studies 
and compares habits in non-television and in view- 
ing areas. In general, it finds television viewing to 
be a time demanding activity whichresults in reduc- 
tions in movie going, radio listening, comic book 
reading, and group play activities. 

Psychologists, self-styled as well as profes- 
sional, will find much of interest in these studies. 
Through a rational approach the authors seek to de- 
termine the reasons why children select certain pro- 
grams and reject others. То what ends do they use 
the programs? Variables, Such as age, sex, men- 
talability, social norms, and social relationships, 
are associated with reactions observed in children. 

For example, it was found that when a child of 
above average intelligence has unsatisfactory rela- 
tionships with his family or friends he tends to re- 
treat to television for escape and to reduce tension. 
In comparison, children of lower intelligence, who 
normally are greater users of television, turn from 
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television in times of such conflict. They seek vi- 
olence, and remember it in real life when there is 
need to be aggressive. МА 

The “fantasy” needs of children seem to be 
served by certaintelevisionprograms. Such learn- 
ing as accrues is an added increment. Vocabulary 
increases result. Though television is a real aid to 
learning for very young children, it contributes little 
to reality experience or the kind of knowledge learned 
in school. 

Among older children the change from fantasy 
to reality is reflected in the smaller amount of time 
devoted to television and thegreater amount of time 
devoted to reading. In school, teen-agers who are 
heavy users of television do less well in knowledge 
tests than do light users. 

It is not possible to determinethat television is 
good or bad for children. There is always the re- 
lationship between a kind of television and a kind of 
child in a kind of situation. It is necessary to con- 
sider the whole life of the child as influenced, also, 
by family, friends, church, and school. An exten- 
sive appendix reports in detail many studies dealing 
with such factors. 

Challenging questions are posed to parents, 
Schools, government, researchers, and others con- 
cerned with television consumption. 

In summary, the authors say, ‘We are faced 
by a problem in cooperation and conservation. We 
have a resource in children, and a resource in tele- 
vision. Weare concerned that television should 
strengthen, not debilitate, the humanresource. This 
end can be accomplished most easily, most effec- 
tively, not by unilateral activity on the part of the 
television industry, or by parents, or schools, or 
churches, but rather by mobilizing allthe chief 
forces in society which bear on the television-child 
relationship. It must be a shared effort to meet a 
Shared responsibility. We must mobilize the talent 
and sense of public responsibility of broadcasters, 
the love and guidance and companionship of parents, 
guidance from schools, and yes, even the skills and 
interest of researchers. Tothe carrying out of this 
last responsibility, we hope this book will make 
slight contribution. ?? 

Television is an important element in the lives 
of children and it is here to stay. This book pro- 
vides guide posts to direct its use in proper paths. 

Harold A. Engel 
University of Wisconsin 


POLICIES FOR SCIENCE EDUCATION, Frederick 
L. Fitzpatrick (ed.,). New York: Bureau of 
Publications, Teachers College, Columbia Un- 
iversity, 1960, x 219 pp. 


A group of science educators associated with 
the Science Manpower Project have written this book 


for school administrators, supervisors, and the 
general public. The report is concerned with mat- 
ters of general policy and other factors related to 
the administration of science programs. 

Because the book was written for those desir- 
ing a general view of the problems of science educa- 
tion, a particular science teacher will not find in it 
help for any specific problems he might face. 

The authors have considered the reasons for 
teaching science at all levels and the responsibilities 
of policy makers and have offered suggestions for 
improving existing science programs. Throughout 
the book the need for a planned but flexible K-12 
Science program is stressed. 


The results of numerous surveys are used to 
describe existing science programs. The difficulties 
of elementary science teachers responsible for 
teaching numerous subjects are described. In addi- 
tion to lacking time for planning, these persons of- 
ten lack academic preparation inthe sciences. Other 
problems considered are the impact of expanding 
elementary science programs on the curriculums of 
the junior and senior high schools, the lack of prep- 
aration of many вес éndary science teachers, and 
the frequent lack of pubic support for more empha- 
sis on the teaching of science. 


To meet many of these difficulties the authors 
suggest the formulation of policies by science edu- 
cators and school administrators which would pro- 
vide for the in-service training of teachers, an or- 
ganized K-12 program, more teacher time for plan- 
ning and improved undergraduate courses for pros- 
pective teachers. 


The authors deal with these problems ina general 
way and do not mention many of the attempts which 
have been made to improve science instruction. 
Among these are the Biological Science Curriculum 
Study, the Chemical Bond Approach program, the 
Harvey White physics films and numerous experi- 
ments with team teaching in science. Because this 
book is one of aseries of publications by the Science 
Manpower Project the suggestions for science 
courses presented inthese other monographs are 
mentioned. 


Certainly if policies for the improvement of 
science instruction are to be formulated by educators 
and supported by the public they must be based on 
specific problems of the local school system and 
knowledge of the most desirable educational prac- 
tices. In this respect, the authors do not show how 
good policies come about. Perhaps a ‘‘case history" 
of a school system that brought about particular im- 
provements by the adoption of sound policies would 
provide the reader with a stronger stimulus to g0 
forth and recommend new policies in his situation. 

Richard E. Haney 
University of Wisconsin-Milwaukee 
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"It Says in the JER 


TENSIONS ARE OFTEN CREATED IN THE CLASSROOM when a teacher presents 
materials contrary to the beliefs of her pupils. A tèacher who attempts 


to modify attitudes by discussing tension-provoking topics should be skilled : 


in evaluating student tensions. Сап a teacher infact ‘‘measure’’ tensions? 
Yes, says Facen after studying the degree of tension produced inselected 
white students when presented with certain intormation about Negroes. 

She concludes that 1) judges can concur as to the degree and kind of ob- 
servable tensions produced, 2) there is no relation between attitude and 
tension, and 8) the g'is no relation between attitude and number of re- 
sponses. b 


IF YOU WANT TO BE THE DEAN OF A CHURCH-RELATED col- 
lege, you must be a statesman in relations with president, faculty, 
students, and public; you should be trained inthe intricacies of 
administration; and you ought to be prepared to work diligently at 
improving climate, facilities, and personnelfor productive schol- 
arship. But you won't be concerned with the religious tone of the 
campus, and you needn't be а nationally recognized scholar in 


your own right. So reports Dicks after sam pling opinions at 41 
Presbyterian colleges. 


ҹу 

WHEN CHILDREN OF LOW, average, and high intelligence receive learning tasks 
graded appropriately to their levels of achievement, they retain and trans- 
fer equally wellto new situations of appropriate difficulty. This finding 
by Klausmeier and Check is based on experimentation with human subjects 
in school settings, and runs counter to earlier studies based on laboratory 
research. The implication: ‘‘Classroom téachers should find out the present 
achievement level of each child and then offer individual or small-group in- 
struction. "' 


. . . Continued on page 306 
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THE NEED FOR the present study of the functions 
and qualifications of the academic dean of the Pres- 
byterian Church-related colleges is apparent from 
a review of the literature in the field of educational 
administration. A review of the literature reveals 
the following: the lack of definitive studies relative 
to the functions and qualifications of administrative 
officers as a whole; the lack of recent definitive 
studies of the denominational liberal arts colleges; 
_ the absence of comprehensive studies of the functions 
and qualifications of the academic dean. 

Thé questionnaire technique was selected for this 


sources in the area of questionnaire construction. 
The present study does not begin and end with this 
method but uses it, where it deems it appropriate, 
that is, in determining the current f uctions and 
qualifications of the academic dean. Other sources 
are used in ascertaining the causes of the p resent 
| conditions, and still other techniques are used in 
evaluating the findings and suggesting improvements. 
7 After careful study the writer selected the ‘YES- 
NO’ answer andthe concise one-line question in or- 
der to make the questionnaire as attractive as pos- 
Sible to the persons answering the question naire 
while atthe same time allowing for the greatest pre- 
Cision of tabulation of results. 

_ Inthe present study three:short questionnaires, 
in place of one long questionnaire, were constructed 
ав follows: Questionnaire I, consisted of thirty ques- 
tions, concerning the functions of academic dean, 
followed by a double column for checking yes or no 
to therightofthepage. The president, the academic 
dean, and three representative fac ulty members 
Íromeach college, were asked to answer each ques- 
tion without conferring with the other members of 
the college. Questionnaire I, as also II and III were 
limited to one page. 

Questionnaire П, tobe answered by the president 
and the academic dean only, was constructed much 
like the first questionnaire. However, there was no 
duplication of questions inany of the three question - 
naires, Questionnaire II, consisting of thirty-five 
co MES: was dividedinto two parts. The first part 
consisted of eighteen questions relative to the func- 


study following a careful review of the available . 


Тһе Functions and Qualifications of the Academic Dean in 
Presbyterian Church-Related Colleges 


CLAUDE D. DICKS 
St. Petersburg, Florida 


tions of the academic dean that it was not thought 
appropriate to include in Questionnaire I, since the 
faculty members would not be in a position to answer 
them. The second part of Questionnaire II related to 
qualifications of the academic dean consisted of sev- 
enteen questions. 

Questionnaire III, of the same general format as 
the first two questionnaires, though much shorter, 
consisted of fifteen questions. This questionnaire 
was also divided into two sections, the first section, 
on functions, consisted of five general questions rel- 
ative tothe ree to which the respondents thought 
that the academic dean used democratic procedure. 
The remainingten questions related to qualifications 
of the academic dean. Since some of the questions 
in Questionnaire III were personal, the dean was not 
asked to answer questionnaire III, the questionnaire 
being answered by the president of each college and 
three representative faculty members. As in Ques- 
tionnaires I and II, the third questionnaire was an- 
swered by checking yes or no. 

Eleven questionnaires, in one packet, were sent 
to each of the forty-one accredited Presbyterian 
Church-related colleges. The reason for sending the 
questionnaires notonly tothe deans but to the pres- 
idents and three representative faculty members 
was to makethe results more objective, more valid, 
and more reliable. 

Table I consists of a listing of the questions as 
they appeared in Questionnaire I, followed by the 
percentage of yes and no answers from all the col- 
leges participating, firstforthe deans, thenfor the 
colleges as a whole; that is, deans, presidents and 
representative faculty members. 


Suggested Improvements in the Functions of the 
niente Dean 


1. The dean of the college should function as key 
person to bring about a more democratic relation- 
ship between administration and faculty. 

2. The dean of the college should speed the proc- 
ess of shifting more and more of the traditionally 
technical duties of the deanship to special personnel 
trained for this particular phase of work. 
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TABLEI 


RESULTS OF QUESTIONNAIRE I 


Functions of the academic dean Deans Colleges 
Yes No Yes No 


With ref. to edu. activities does dean: 


1. initiate and actively guide curr. revision 92 08 83 17 
2. supervise quality of college instruction 88 12 76 24 
*. 3, supervise scheduling of classes 80 20 75 25 
4. supervise entrance tests 32 68 38 62 
5. supervise recommendations for scholarships 40 60 40 60 
6. process all petitions of students 72 28 66 34 
7. submit lists of students for graduation 41 59 47 53 
8. supervise all student personnel work 32 68 25 75 
9. supervise edu. and administrative research 60 40 59 41 
10. edit college catalog on academic matters 80 20 81 19 
With ref. to faculty does dean: 
11. execute policies adopted by faculty 100 00 96 04 
12. study and attempt to adjust fac. salary 60 40 57 43 
13. represent fac. in relations with public 88 12 72 28 
14. supervise all faculty committees 64 36 54 46 
15. encourage advanced study by faculty 100 00 88 12 
16. direct attention to latest edu. thought 96 04 85 15 
17. promote faculty-student relations 100 00 88 12 
18. promote fac. social and recreational life 60 40 49 51 
19. obtain position thru election by faculty 04 96 02 98 
With ref. to students does dean: 
20. supervise changes in program of courses 60 40 та 26 
21. supervise enforcement of atten. regulations 68 32 71 29 
22. supervise transfer students 52 48 58 42 
23. supervise all student discipline in college 52 48 58 42 
With ref. to administration does dean: 
24. preside at all faculty meetings 20 80 28 72 
25. prepare agenda for faculty meetings "1 29 71 29 
26. supervise preparation of edu. budget 42 58 38 62 
27. serve as member of admin. council 92 08 94 06 
28. report to president on work of college 75 25 83 17 
29. obtain position thru appointment by pres. 84 16 87 13 
30. set the religious tone of the college 13 87 23 77 
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TABLE Ia 


SUPPLEMENT 3 


Functions of the academic dean Deans Colleges 
Yes No Yes No 


Marked prevalence of functions listed: 


1. execute policies adopted by faculty 100 00 96 04 

. encourage advanced study by faculty 100 00 90 10 

3. promote faculty-student relations 100 00 88 12 
4. direct attention to latest educational thought 96 04 85 15 
5. initiate and actively guide curriculum revision 92 08 83 17 
6. serve as member of administration council 92 08 94 06 
7. supervise quality of college instruction 88 12 76 24 
8. represent faculty in its relation with public 88 12 72 28 
9. obtain position thru appointment-president 84 16 87 13 
10. supervise scheduling of classes 80 20 75 25 
11. edit college catalog on academic matters 80 20 81 19 
12. report to president on work of college 75 25 83 17 
13. process all petitions of students 72 28 66 34 
14. prepare agenda for faculty meetings 71 29 71 29 
15. supervise enforcement of attendance regulations 68 32 71 29 
16. supervise all faculty committees 64 36 54 46 

No marked prevalence or lack of functions: 
17. supervise educational and administrative research 60 40 59 41 
18. study and attempt to adjust faculty salaries 60 40 57 43 
19. promote faculty social and recreational life 60 40 49 51 
20. supervise changes in program of courses-studies 60 40 74 26 
21. supervise transfer students 52 48 58 42 
22. supervise all student discipline in college 52 48 50 50 
23. supervise preparation of educational budget 42 58 38 62 
24. submit lists of students for graduation 41 59 47 53 
Marked lack of functions listed: 

25. supervise recommendations for scholarships 40 60 40 60 
26. supervise entrance tests 32 68 38 62 
27. supervise all student personnel work 32 68 25 75 
28. preside at all faculty meetings 20 80 a 1, 
29. set the religious tone of the college dd bó a B 


30. obtain position thru election by faculty 
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TABLE II 


RESULTS OF QUESTIONNAIRE П 
a aii SS a D s астана REA T emi MA 


Functions and qualifications of academic dean Deans Colleges* 
Yes No Yes No 


Functions of the academic dean: 
With ref. to edu. activities does dean: 


1. preserve records of conduct and attend. 65 35 60 40 

2. interview students on all aca. matters 59 41 66 34 

3. give vocational guidance to students 48 52 44 56 

4. supervise placement of students 22 78 19 81 

5. sup. and administer loan funds to students 13 87 09 91 

6. correspond with parents regarding student welfare 87 13 88 12 

Ж т. supervise est. of student organizations 36 64 41 59 

With ref. to administration does dean: 

8. supervise clerical staff of the college 04 96 04 96 

9. supervise registration 74 26 70 30 

10. supervise admissions 52 48 49 51 
11. recommend persons to pres. for appointment 83 15 87 13 
12. recommend faculty members for promotion 83 17 87 13 
13. recommend faculty members for salary increases 78 22 83 17 
14. perform any functions independent of pres. 36 64 53 4 
15. hold additional administrative titles 32 68 30 70 
16, participate in formulation of college policy 100 00 100 00 
17. participate in laying future plans-college 96 04 98 02 
18. plan college assembly and chapel programs 30 70 26 74 


Qualifications of the academic dean: 
With ref. to academic training does dean: 


19. have a В.А. or B.S. degree 100 00 100 00 
20. have a B.D. degree 18 82 17 83 
21. have an M. A. or M.S. degree 95 05 98 02 
22. have an Ed. D. degree 14 86 16 84 
23. have a Ph.D. degree 70 30 71 29 
24. have aD.D. or other honorary degree 00 100 02 98 
25. have an academic major in administration 30 10 28 72 
26. have academic courses relative to deanship 35 65 22 78 
27. have academic training in higher education 57 43 59 41 
With ref. to general background does dean: 
28. attend administrative workshops 78 22 83 17 
29. have experience in administration 86 14 е 83 17 
30. have tenure exceeding five years 81 19 80 20 
31. attend professional meetings of administrators 100 00 100 00 
32. write articles in field of administration 22 "8 22 78 
33. have Presbyterian religious background 70 30 73 27 
34. have salary above highest faculty salary 89 11 88 12 
35. have caucasian background 96 04 96 04 


*Combined answers of deans and presidents 
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TABLE Па 
Supplement 2 
M ————————————————— 


Functions and qualifications of academic dean Deans Colleges 
Yes No Yes No 


Functions of academic dean: 
Marked prevalence of functions listed: 


1. participate in formulation of college policy 100 00 100 00 
2. participate in laying future plans 96 04 98 02 
3. correspond with parents regarding student welfare 87 13 88 12 
4. recommend persons to pres. for appointment 83 17 87 13 
5. recommend faculty members for promotion 83 17 87 13 
6. recommend faculty members for salary increase 78 22 83 17 
7. supervise registration 74 26 70 30 
8. preserve records of conduct and attendance 65 35 60 40 
No marked prevalence or lack of functions: 
9. interview students on all academic matters 59 41 66 34 
10. supervise admissions 52 48 49 51 
11. give vocational guidance to students 48 52 44 56 
Marked lack of functions listed: 
12. perform any functions independent of pres. 36 64 53 47 
13. supervise est. of student organizations 36 64 41 59 
14. hold additional administrative titles 32 68 30 10 
15. plan assembly and chapel programs 30 70 26 74 
16. supervise placement of students 22 18 19 81 
17. supervise and administer loan funds to students 13 87 09 91 
18. supervise clerical staff or college 04 96 04 96 


Qualifications of academic dean: ; 
Marked prevalence of qualifications listed: 


1. have а В.А. or B.S. degree 100 00 100 00 
2. attend professional meetings of administration 100 00 100 00 
3. have caucasian background 96 04 96 04 
4. have M. A. or M. S. degree 95 05 98 02 
5. have salary exceeding highest faculty salary 89 11 88 12 
6. have background of experience inadministration 86 14 83 17 
7. have tenure exceeding five years 81 19 80 20 
8. attend administrative workshops 78 22 83 17 
9. have Presbyterian religious background 70 30 73 27 
10. have a Ph.D. degree 70 30 71 29 
No marked prevalence of qualifications listed: 
1% didis ro training in higher education 57 43 59 41 
Mark alifications listed: 
nr tae she cr courses relative to deanship 35 65 22 78 
13. have academic major in administration 30 70 28 72 
14. write articles in field of administration n 5 n p 
15. have a B.D. degree {4 aa ij A 
16. have an Ed.D. degree SUC id i йр 


17. have а Б.В. or other honorary degree 
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TABLE III 


RESULTS OF QUESTIONNAIRE III 


Functions and qualifications of academic dean Presidents Colleges 
Yes No Yes No 


Functions of academic dean: 
With ref. to democratic procedure does dean: 


1. encourage use of free intelligence 100 00 99 01 
2. encourage participation by parties concerned 100 00 97 03 
3. encourage useofind. differences of background 100 00 96 04 
4. encourage use of free cooperation 100 00 96 04 
5. act as guide in academic affairs of college 100 00 92 08 


Qualifications of academic dean: 
With ref. to general qualifications is dean: 


6. a capable and astute parlimentarian 87 13 80 20 
7. able to instill confidence in staff 96 04 88 12 
8. close working relationship with president 100 00 98 02 
9. a shrewd judge of people 82 18 82 18 
10. careful in statements at all times 100 00 86 14 
11. a person of unquestioned integrity 100 00 97 03 
12. an effective administrator 88 12 86 14 
13. an eminent scholar known nationally 16 84 20 80 
14. able to harmonize personalities of faculty 84 16 77 23 
15. male 88 12 88 12 


*The deans were not asked to answer the third questionnaire so the first double column presents 
the answers of the presidents. The columns under “СоПерев” indicate the responses of the 
presidents and representative faculty members from the various colleges. 


Table Ш, supplement 2, presents the questionnaire statements with responses in decreasing 
percentage and prevalence, 


TABLE Ша 
SUPPLEMENT 2 


че ———————————————————————————— 


Functions an qualifications of academic dean Presidents Colleges 
Yes No Yes No 


Functions of academic dean: 
Макен prevalence of functions listed: 


encourage use of free intelligence 100 00 99 01 
2. encourage participation by parties concerned 100 00 97 03 
3. encourage use of individual differences of back- 

ground 100 00 96 04 
4. encourage use of free cooperation 100 00 96 04 
5. act as guide in academic affairs of college 100 00 92 08 


Qualifications of academic dean: 
Marked prevalence of qualifications listed: 


1. іп сіове working relationship with president 100 00 98 02 
2. a person of unquestioned integrity 100 00 г 97 03 
3. careful in statements at all times 100 00 86 14 
4. able to instill confidence in teaching staff 96 04 88 12 
5. an effective administrator 88 12 86 14 
6. male 88 12 88. 12 
7. a capable and astute parliamentarian 87 13 80 20 
8. able to harmonize personalities of faculty 84 16 77 23 
9. а shrewd judge of people 82 18 82 18 


Marked lact of qualifications listed: 
10. an eminent scholar known nationally 16 84 20 80 


(The present study is written upina 180 page report including extensive evaluation of findings 
and a comparison with the Methodist and Baptist church-related colleges. This study began 
in 1956) 
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TABLE IV 


MASTER TABLE COMBINING THE RESULTS OF TABLES I, П, AND III SHOWING DECREASING PER- 
CENTAGE OF PREVALENCE OF THE FUNCTIONS AND QUALIFICATIONS OF THE ACADEMIC DEAN 
INDICATED IN THE EIGHTY-SIX PERCENT OF THE PRESBYTERIAN CHURCH-RELATED 
COLLEGES PARTICIPATING IN THE PRESENT STUDY 


SSS 


Functions and qualifications of academic dean Dean President Faculty College 


Nene oer Ды Ді a ee Uc ange momen m e WEIT 


А. Functions: 


jd 


П. 


Marked prevalence of functions listed 


1. execute policies adopted by faculty 100 96 95 96 
2. encourage advanced study by faculty 100 100 82 90 
3. promote faculty-student relations 100 95 79 88 
4. participate in formulating college policy 100 100 100 100 
5. encourage use of free intelligence 100 100 100 
6. encourage participation by all concerned 100 96 98 
7. encourage use of individual differences of background 100 95 97 
8. encourage use of free cooperation 100 95 97 
9. act as guide in academic affairs of college 100 89 92 
10. direct attention to latest educational thought 96 100 74 85 
11. participate in laying future plans for college 96 100 98 
12. initiate and actively guide curriculum revision 92 92 76 83 
13. serve as member of administrative council 92 92 95 94 
14. supervise quality of college instruction 88 88 67 76 
15. represent faculty in relations with public 88 78 62 72 
16. correspond with parents regarding students M 55 A 5 


17. obtain position through appointment 


18. recommend persons to president for appointment 83 90 87 
19. recommend faculty members for promotion 83 90 87 
20. supervise scheduling of classes 80 87 69 15 
21. edit college catalog on all academic matters 80 83 81 81 
22. recommend faculty for salary increases 18 87 83 
23. report to president on work of college 75 83 88 83 
24. supervise registration қ 74 67 70 
25. process all petitions by students 72 56 68 66 
26. prepare agenda for faculty meetings 71 79 70 11 
27. supervise enforcement of attendance regulations 68 79 10 11 
28. preserve records of conduct and attendance 65 54 60 
29. supervise all faculty committees 64 64 47 54 
No marked prevalence ог lack of functions: 

30. pae % educational and administrative research 60 64 58 59 
31. study and attempt to adjust faculty salaries 60 52 58 59 
32. promote faculty social and recreational life 60 67 39 49 
33. supervise changes іп program of courses for students 60 83 15 74 
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TABLE IV (Continued) 
Deans 
34. interview students on all academic matters 59 
35. supervise transfer students 52 
36. supervise all student discipline in college 52 
37. supervise admissions 52 
38. give vocational guidance to students 48 
39. supervise preparation of educational budget 42 
40. submit lists of students for graduation 41 
III. Marked lack of prevalence of functions: 
41. supervise recommendations for scholarships 40 
42. perform any functions independent of president 36 
43. supervise establishment of student organization 36 
44, supervise entrance tests 32 
45. supervise all student personnel work 32 
46. hold additional administrative titles 32 
47. plan college assembly and chapel programs 30 
48. supervise placement of students 22 
49. preside at all faculty meetings 20 
50. set religious tone of college 13 
51. supervise and administer loan funds to students 13 
52. supervise clerical staff of college 04 
53. obtain position thru election by faculty 04 
3. Qualifications; 
I. Marked prevalence of qualifications listed: 
1. have а B. A. or B. S. degree 100 
2. attend professional meeting of administration 100 
3. in close working relationship with president 
4. person of unquestioned integrity 
5. careful in statement at all times 
6. have caucasian background 96 
7. able to instill confidence in faculty 
8. һауе M. A. or M. S. degree 95 
9. have salary exceeding highest faculty salary 89 
10. an effective administrator 
11. male 
12. capable and astute parliamentarian 
13. have background of experience in administration 86 
14. able to harmonize personalities of faculty 
15. a shrewd judge of people 
16. have tenure exceeding five years 81 
17. attend administrative workshops 78 
18. have Presbyterian religious background 70 
19. һауе a Ph.D. degree 10 
II. No marked prevalence or lack of qualifications: 
20. have academic training in higher education, etc. 57 


President 


Faculty 


College 


3. The dean of the college should function asa 
statesman in human relations, developing and mold- 
ing the personalities of students and faculty. 

4. The dean of the college should assume the re- 
sponsibility of informing the faculty of its meaningful 
place in the democratic structure and operation of 
the college. 

5. The dean of the college should play alarger 
part in initiating and guiding educational and admin - 
istrative research. 

6. The dean of the college should participate to a 
greater degree inthe preparation of the educational 
budget. 

7. The dean of the college should be selectedin 
such a manner that the faculty participate in the 
selection. 


Suggested Improvements inthe Qualifications of the 
Academic Dean 


1. The dean of the college, as academic dean, 
Should have a Ph.D. degree or its equivalent. 
2. The dean of the college, because of his need for 
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а broader background of understanding in education 
than the average faculty member, should have ad- 
vanced academic training in the field of higher edu- 
cation. 

3. The dean of the college, as administrator, 
should have in his academic background courses rel- 
ative to the deanship. 

4. The deanof the college should have an academic 
major inadministration,as well as advanced training 
in a content field. 

5. The dean of thecollege should contribute to the 
improvement of his office by writing definitive ar- 
ticles in the field of administration. 

6. The dean of the college should contribute to а 


‚ higher degree in the academic achievements of the 


college. 


Note: Theimprovements suggested above are based 
on questionnaires forwarded to the Presbyterian 
church-related colleges in the light of relative stud- 
ies and comments in the field of administration in 


Higher Education. 
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It Says in THE JER (continued)... . 


Groff has found that a c hild's attitude toward what he reads will 
have a significant effect on how well he learns to read, particu- 
larly on a test of criticalthinking. Differences between boys and 
girls were consistent throughout the study. 


FACTORS OTHER THAN THOSE MEASURED BY CONVENTIONAL scholastic tests 


are of critical importance to student achievement in an adult high school, 
Crouch reports. She found no correlation between grades and amount of pre- 
vious education, no correlation between grades and scholastic ability among 
16-18 year olds, and only a low level of correlation between grades and scho- 
lastic ability among 19-48 year olds. Within sub-groups there were no sig- 
nificant differences between single and married students, and veterans and 
non-veterans. Negroes tended to rate below whites in both aptitude and 
achievement. Тһе study suggests that motivation is а key variable in adult 
education. 


Halliwell presents a concrete illustration of THE DANGER IN- 
HERENT IN CORRELATING AVERAGES, particularly in studies 
involving teachers’ marks. 


CAN A STUDENT SUCCESSFULLY ':G-2" HIS INSTRUCTOR? Yes, says Winter. 


In two California studies, grades received in psychology courses were relat- 
ed to similarity in values between student and instructor and also to student’s 
ability to predict his instructor’s values correctly. Inobtaininga measure 

of the value dimension, Winter was only partially successful in substituting 

a quick-ranking procedure for the more laborious semantic differential. 


America’s schools are having a hardtime teaching SKILL IN 
CRITICAL THINKING. **Why?" Fox asked 143 California senior 
high social studies teachers. Over 25% of the teachers blamed 
“student lethargy;’’ 22% said pupils come tothem with insufficient 
experience in analyzing, interpreting, and evaluating information; 
10% complained about ‘‘lack of time” to do the job. One pos- 
sible bright note: recent graduates of teacher-education insti- 
tutions reported better success than did veteran teachers. 


ROCKY BEZZI REPORTS ON a new standardized manuscript scale for grades 1, 2, 


and 3 with which ‘‘teacher, pupil, and administrator may evaluate with a high 
degree of accuracy both rate and quality of manuscript handwriting. ?? 
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Determination of the Degree of Tension Produced in 
White Students when Presented with Certain 


Materials Regarding Negroes 


А CONSIDERABLE amount of literature has been 
written concerning the status of the American Ne- 
gro, attitudes toward him and some of the probable 
causes of these attitudes, as well as classification 
ofthem. Less has been written concerning class- 
room techniques for dealing with tensions created 
when instructional materials pertaining to Negroes 
are presented. 

Often, when teachers attempt to present mater- 
ials which are contrary tothe beliefs of their pupils, 
or when attempts are made to change the beliefs, 
tensions are created. 

Tensions produce observable reactions which 
make it possible for teachers to be aware of the ef- 
fects of presenting controversial instructional ma- 
terials. Teachers who attempt to modify attitudes 
by discussing tension-provoking topics should be 
Skilled in the techniques of observing and dealing 
with tensions resulting from such presentations. If 
they can make evaluations of the tensions being pro- 
duced, then methods of teaching designed to control 
ch utilize the accompanying tension may be devel- 
Oped. 

The problem of this study is todetermine the de- 
gree of tensions produced in selected white students 
when presented with certain beliefs and factual ma- 
terials pertaining to Negroes. 

The problem suggested three hypotheses which 
formed the basis for the investigation: 

1. Judges can concur as to the degree and kind 
of observable tensions produced in selected white 
Subjects. 

Ж 2. There is no relation between attitude and ten- 
ions. 

3. There is no relation between attitude and 
number of responses during a discussion ре 
to Negroes. 

This study was confined tothree junior-s enior, 
ànd two freshman classes from the College of Edu- 
cation of a southern university (hereafter designated 
Group I, Group П, Group Ш, Group IV, Group V). 
With five exceptions, their vocational aspirat ions 
Were in the teaching field. They were asked to in- 


ИЛЕ ---- 
*Footnotes will be found at the end of the article. 


GENEVA FACEN 
Grambling College 


dicate their reaction tosixty-four items on the Pur- 
due Scale! to measure attitudes toward Anglo Amer- 
icans, Latin Americans, and Negroes. * 


Definition oi Terms 


As used in this study, the following definitions 
will apply to various terms: 

Favorable attitude; Subjects scoring 8.0 or above 
on attitude scale. 

Indifferent attitude: Subjects scoring 6.0 on atti- 
tude scale. 

Unfavorable attitude: Subjects scoring 4.0 or be- 
low on attitude scale. 

Attitudes: Attitudes, like other emotional activi- 
ties, may be incomplete, inconsistent, and tend to 
be highly non-specific. Originally, psychologists 
defined an attitude simply as a tendency to seek or 
avoid something. Their early studies of liking and 
disliking, approval and disapproval, were helpful 
though incomplete. We need to go deeper to under- 
stand the reasons behind such responses; therefore 
we have turned to more comprehensive definitions 
or attitudes. 

“Ап attitude is defined as the meanings thatone 
associates with a certain object (or idea) and which 
influence acceptance of it.’’ 

An element of acceptance or avoidance is present 
in any attitude, but additional as so ciations are in- 
volved. One study of attitude as a basis for behav- 
ior is Yates’ investigation of freedomof speech. 
She found “pupils favorable to free speech in prin- 
ciple but not at all clear as to whether one was free 
to speak damaging truth about ,another, or free to 
speak the truth maliciously.” 

As defined by Boring and others, ‘‘An emotional 
response to a situation describes an attitude, a pat- 
tern of conduct in accordwith mental responses to 
the situation. ” 4 

The Dictionary of Psychology defines attitude as 
“е specific mental dispositiontoward an incoming 
(or arising) experience, whereby the experience is 
modified, or, a condition of readiness for a certain 
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type of activity. '' 9 

An extension of the concept of readiness with а 
directive influence is the definitiongiven by Allport: 
“Ап attitude is a mental and neural state of readi- 
ness, organized through experience, existing asa 
directive response to all objects and situations to 
which it is related. ” 6 

Sherif and Cantril also see an attitude as an es- 
tablished readiness. They view this established 

readiness ав: “А subject-object relationship of 
highly variable content, which is learned (formed), 
which has affective properties with various degrees 
of motivational components, which may refer to 
whatever stimuli are encompassed in the subject- 
object relationship, and which deter mines that an 
individual will react to a stimulus ina selective way. 
Once formed, an attitude serves as an anchorage to 
Structure or modify subsequent experience or re- 
sponse. ” 

Thurstone concludes, **Attitude is the sum total 
of man's inclinations and feelings, motives, ideas, 
fears and convictions about any specific topic. »8 

Allport aptly summarizes: “ Attitudes cannot be 
classified satisfactorily, because human personality 
in which attitudes are embedded is an infinitely var- 
iable and constantly changing structure. There are 
no absolute barriers between one attitude and others. 
Attitudes never issue singly into action; they are 
always intensive when they are active.... We know 
they exist only by inference. It is the meaningful 
resemblance between activities and their congruence 
with one another that leads psychologists inescap- 
ably to postulate such generalized forms of readi- 
ness as the term ‘‘attitude’’ denotes. These forms 
of readiness oues not be thought of in abs olutistic, 
single terms. ” 

The operational definition of an attitude employed 
in this study was derived from scores computed on 
the attitude scale administered to all subjects. This 
was based on the assumption that the reactions re- 
flected a mental ‘‘set’’? which directed the individ- 
uals’ responses. This mental ‘‘set’’ is also assum- 
ed to be the result of a pattern of conduct in accord 
with each subject’s experiences, background, per- 
sonality and beliefs which structured his responses 
on the Purdue Scale. 

Degree of prejudice: The degree to which scores 
varied from the indifference value of 6.0 on the Pur- 
due Attitude Scale employed in this study. 

Prejudice: Prejudice consists inalarge measure 
of a person’s emotional attitude, predispositions, 
and the character of those highly energizedforms of 
activity when one encounters frustrations, insecur- 
ity and emergency situations. 

Kingsley and Gray’s study on emotional activity 
concludes that ‘‘extensive differences between var- 

ious individuals are to be found with respect to their 
propensities for fear, anger, jealousy, joy and oth- 
er forms of emotional upset. ’’ 0 

Since prejudice is ап emotional event, a compli- 
cated process, a curious mixture of feelings, and 


*mysterious-emotions' analysis must be made 
piecemeal. 

Warren defines prejudice as an attitude, ‘‘either 
for or against a certain unproved hypothesis which 
prevents one from evaluating new evidence correct- 
ly. Prejudice carries an emotional implication; the 
synonym bias strictly lacks this element. 11 

Radke, Trayer, and Davis see prejudice asa 
**contagion, ” being passed through society asa 
communicable virus. ‘‘Prejudices develop out of 
adult values and the status quo to which children are 
exposed as well as out of situations generating in- 
securities and frustrations. "12 

The very nature of this experiment links preju- 
dice with an emotional event. Subjects are more 
inclined to be “іог” or **against", therefore the 
working definition employed іп this study carries an 
emotional implication. 7 

Tension: Tension, as employed in this study, is 
defined as the emotional reactions selected from 
the literature and those observed by the judges dur- 
ing the investigation. 

Subjects exhibit differences in the degree of their 
tension reactions, the range and nature of the con- 
ditions that arouse emotions, and the manner in 
which they react when confronted with tension-pro- 
voking discussions. Probably there are as many 
variations and degrees of emotional reactions as 
there are different ways of perceiving the emotional 
situations. 


The Method 


Scores on the Purdue Scale, computed for atti- 
tude toward Negroes, formed the basis for the se- 
lection of the subjects to be observed. The subjects 
were unaware that they had been selected as sub- 
jects or that an investigation was being conducted. 

Those selected were determined by judges who 
based their decisions on the students' scores on the 
Purdue Scale. High scores represent favorable at- 
titude toward the race being measured, and low 
Scores unfavorable. 

Before the presentation, each judge was given а 
copy of the resource unit ‘Тһе American Negro"? 
from which he prepared his discussion, a copy of 
the seating and tension chart, and some instructions 
as to its use. 

For purposes of this experiment, eighteen ob- 
servable tension reactions were under observation. 
It would have been impossible to observe the total 
emotional event, but three major aspects were ta- 
ken into account. These were skeletal or overtbe- 
havior, visceral behavior, and emotional experi- 
ence. 

The skeletal behavior consisted of the kind of ac- 
tivity produced directly following the statements 
made by a speaker. Such activity was readily vis- 
ible to the observers, and was usually characteriz- 
ed by abrupt changes in Skeletal positions. This 
was the rationale for including the following items: 


Neck muscle activity 
Forearm muscle activity 
Leg muscle activity 
Gritting of teeth 

Clinching of fists 

Arching of back 

Nodding head (agreement) 

. Shaking head (disagreement) 

This behavior often, though not always, served 
to reveal to the observers the nature of the emotion 
which the subject exhibited. 

Visceral behavior, because of its internal na- 
ture, is not easily and readilyobservable. Some of 
the observable effects included in this study are: 
Blink rate - change in the rate 
Diction - including tone of voice 
Wrinkling of brow 
Sneering 

5. Perspiration 

It seemed appropriate for the purposes of this 
study to consider the total emotional event as ten- 
sion in terms of the reactions observable since this 
behavior was a function of the experiment, deliber- 
ately activi ted by the topics being discussed per- 
taining to Negroes. Therefore, three observations 
were based on the visible indicators of tensions re- 
sulting or accompanying the total emotional experi- 
ence. 

1. Weeping - tears - quivering of throat muscles 

2. Derisive laughter 

8. Squirming - obvious discomfort 

This approach utilized these major aspects of 
tension which made observations possible. They 
were based on the emotional behavior (consisting of 
the bodily reactions), both overt and visceral and 
the total emotional experience (made up of sensa- 
tions, feelings, personality and perc eptual mean- 
ings) brought to the situation by the attitude held by 
each subject. 

The degree of tension was measured by а scale 
value assigned on the basis of number of reactions 
recorded by the judges. The unit of measurement 
was based on number of tension reactions recorded 
during control and experimental periods. 

The hazards involved in diagnosing reactions 
from outward manifestations were reduced by not 
disclosing to the judges the attitude toward Negroes 
held by the subjects under observation. 

This research design provided an opportunity for 
Studying tension reactions of the subjects according 
to the degree of prejudice and predispositions al- 
Teady measured by attitude test scores. 
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The Treatment of Data 


The measure of tension evidenced by the individ- 
ual during the experiment (a subjective measur e), 
was related to the objective measure (attitude to- 
ward the Negro), as recorded on the Purdue Scale. 

_ The discrepancies between objective and subjec- 
tive data were checked by establishing the reliabil- 
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ity coefficients between and among the observations 
recorded by the judges as to the degree of tension 
shown by certain subjects and tested at the five per- 
cent confidence level. Inasmuch as each group of 
judges observed the same number of pupils with 
varying degrees of prejudice, the mean observa- 
tions and standard deviations were computed for 
each group. 

The standard deviations of Groups II, III, and IV 
are larger than those of Groups I and V, indicating 
that the former groups havea higher degree of var- 
iability. 

Coefficients of correlation were computed to de- 
termine the degree of relations between attitude and 
tension reactions, and attitude and verbal respon- 
ses during a discussion pertaining to Negroes. 

The coefficients of correlation would seem to in- 
dicate that positive relations bet ween attitude and 
verbal responses exist among subjects in GroupI 
(.40), while Groups П (-68), Ш (-.38) IV  (-.88) 
and V (-.36) show negative coefficients. А віші- 
lar pattern emerged when determining the relation 
between attitude and tension reactions during the 
discussion. Coefficients of .29, -.13, -.14, -.68, 
and -.20 are reported respectively for each of the 
five groups. 

However, when scores from each of the five 
groups were placed in rank order on the basis of 
attitude toward the Negro (without the division of 
groupings), low, but negative coefficients of corre- 
lation were found to exist among these factors for 
all subjects. A correlation of -.16 was f ound to 
exist between attitude and number of responses 
made by subjects during the discussion. Acorre- 
lation of -.05 was found to exist between attitude 
and tension reactions of all students. These values 
are not significant at either the five or one percent 
levels of confidence. 

These low negative coefficients indicate that 
there is no relationship between attitude and tension, 
as measured in this investigation, and attitude, as 
measured by the Purdue Scale, and number of re- 
sponses made during a discussion pertaining to Ne- 
groes. 

Although the coefficients computed were negative, 
and were not statistically significant at the five or 
one percent level, there appeared to be substantial 
variation among groups holding different degrees of 
prejudice toward Negroes. In order to establish 
whether subjects holding favorable attitudes (Group 
A), neutral attitudes (Group B), andunfavorable at- 
titudes (Group C) toward Negroes differed signifi- 
cantly in their number of responses during the dis- 
cussion and degree of tension reactions. The fig- 
ures shown in Tables I, П, and III were computed 
in order to apply analysis to these data. 

The value of 182. 2 was computed from Table III, 
for number of responses by group during the dis- 
cussion. This value is significant at the one per- 
cent level of confidence as determined by the tables 
compiled by Fisher and Yates. 
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TABLEI 


ANALYSIS OF VARIANCE OF RESPONSES DURING DISCUSSION OF 
GROUPS HOLDING UNFAVORABLE, NEUTRAL AND 
FAVORABLE ATTITUDES TOWARD NEGROES 


pn ынан 


Sources of Variation Sum of Squares dt Mean Squares 
Between Groups 54. 2798 2 27.1399 
Within Groups 14.4502 97 .1490 
Total 68.7200 99 

TABLE II 


ANALYSIS OF VARIANCE OF TENSION SCORES OF GROUPS HOLDING 
UNFAVORABLE, NEUTRAL AND FAVORABLE 
ATTITUDES TOWARD NEGROES 


Sources of Variation Sum of Squares df Mean Squares 

Between Groups 39. 6754 2 19. 8377 

Within Groups 10. 2458 97 . 1057 
TABLE III 


RELIABILITY OF DIFFERENCE AMONG GROUPS 


F Significance of F 
Analysis of variance of tension scores by groups holding un- Significant at 
favorable, neutral, and favorable attitudes toward the Negro 187.6 195 level 
Analysis of variance of responses during discussion by groups Significant at 
holding unfavorable, neutral, and favorable attitudes toward 182.2 1% level 
the Negro 
prac Менын када на ГТ es Soc cP Ya саса XP nuu s. so 
TABLE IV 
SUMMARY DATA 


Group No. Attitude Toward No. of Responses Measure of Tension 
Negroes During Discussion 
= тенине нан CHUTE a RUE На CERTUM CMM ISO TN 
A 20 X 60.1 ү 27 Y 12 
X 3.0 Y 1.35 Y... 06 
d$ 24. 5500 d$ 26.1880 
B 7 X 36.7 Xd Y 16 
xX 5.2 Y. 1.57 Y 2.29 
d$ 1.6471 48 3.4051 
с 23 X205.5 Y 14 Y 14 
X 8.5 Y 5l Y .61 
dS 13. 4783 ds 24.0867 


This significance indicates that there is some 
difference. By inspection it probably lies between 
Group A and Group C, as shown in Table IV. 

Similarly, the F value of 187.6 was computedfor 
tension scores on each group. It is also significant 
at the one percent level of confidence. This differ- 
ence probably lies again between Group A and Group 
C. The reliability of these differences at the one 
percent level of confidence is shown in order of 
their significance in Table Ш. 


Conclusions 


The major thesis of this study was that tensions 
would be created during a discussion pe rtaining to 
Negroes. This assumption was based on the idea 
that different attitudes toward the Negro would pro- 
duce varying degrees of tension, which could be ob- 
jectively measured. 

Observable tension was produced in white sub- 
jects when presented with certain unfounded beliefs 
and with factual materials pertaining to Negroes. 

When contrasting observations were tabulatedfor 
reactions during the period when the discussion per- 
tained to Negroes against the reactions when the 
topic was a regular classroom dis cussion, each 
judge expressed the opinionthat there was a definite 
rise in tension. This was further supported by a 
substantial increase in types and frequency of cer- 
tain tension reactions as compared to previous re- 
actions of subjects. 

Hypothesis I. Judges can conc ur as to the de- 
gree and kind of observable tensions produced in 
subjects. Two hundred thirty-five observations 
were recorded by the judges. Eighty perc ent of 
these observations were confirmed by two of three 
judges. 

Reliability of ratings based on judges’ records 
of tension reactions was .63, .70, .51, .44, and 
.15. Reliability of average ratings for three judges 
observing each group was .84, .86, .76, .90, 70 
and .90. When these coefficients were tested for 
reliability, they were well within the one and five 
percent confidence limits. 

Hypotheses П and Ш. There is no relation be- 
tween attitude and number of responses, and atti- 
tude and tension during a discussion pertaining to 
Negroes. 

Negative coefficients of correlations of -. 16 and 
-.05 were obtained for both of these relations when 
determined for 50 subjects. Positive though low 
correlations were found to exist in Group I between 
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attitude and response— .40, andattitude and tension 
reaction, .29. Each of the four remaining groups 
had low, but negative, coefficients of correlation. 
At the one and five percent levels of confidence, 
these values are not significant, hence the hypothe- 
ses were accepted. 


FOOTNOTES 


1. The Purdue Scale of Measuring Attitudes to- 
ward Races was developedbyH. Н. Grice and 
edited by H. H. Remmers, published by Pur- 
due University Press. 

2. Lee J. Cronbach, Educational Psycholo Е Y 
New York: Harcourt Brace and Company, 5 
p. 326. 

3. Ida Yates, “Сопсерів and Attitudes Concerning 
Slander and Freedom of Speech," Journal of 
Educational Research, 27 (1933), рр. 283-97. 

4. Edward Boring, Herbert Langfeld and Harry 
Weld, Introduction to Psychology, New York: 


John Wiley and Sons, , p. 39. 
5. H. C. Warren (Ed.), Dictionary of Psychology, 
Boston: Houghton Mifflin Company, ‚р. 24. 


6. Gordon W. Allport, *Attitudes," Handbook of 
Social Psychology (Ed., Carl Murchison) 
Worchester, Mass.: Clark University Press, 
1935, p. 810. 

7. M. Sherif and Н. Cantril, The Psychology of 
Ego-Involvements: (Social Ее апі Ydon- 
tifications), New York: John Wiley and Sons, 


Inc., 1947, p. 5. 
8. L. L. Thurstone, *'Attitudes Can Be Measur- 


ед,” American Journal of Sociology, 1928, 
Vol. 33, p. 547. ey, 


9. Allport, Op. Cit., p. 836. 

10. H. L. Kingsley and R. Gray, The Nature and 
Conditions of Learning, New York: Prentice 
Hall, Inc., 1957, p. 437. 

11. Warren, Op. Cit., p. 208. 

12. H. G. Radke, H. Trayer, and H. Davis, ''So- 
cial Perceptives and Attitudes of Children, ’’ 


Genetic Psychology Monographs, 40, ( Novem- 
ber, 1940), p. 330 


13. “Тһе American Negro”’ isa resource unit from 
which the materials were secured for presen- 
tation. It consists of historical data, contri- 
butions made by famous Negroes, beliefs and 
fallacies held by many concerning Negroes, 
accompanied by factual information. 

14. R. A. Fisher and F. Yates, Statistical Tables, 
New York: Hafner Publishing Company, 1953. 


B Field News 


HOWARD F. HJELM OF THE DIVISION OF EDUCA TION OF THE PAN AMERICAN 
UNION announces that the educational research unit currently is conducting the 
following investigations on education in twenty Latin American republics: 


1. A study of the structure of supervision and the functions of the supervisors 
іп the primary schools. 

2. A study of the primary school textbook situation and the preparation of a 
manual of guidelines for the preparation, publication, and selection of pri- 
mary school textbooks. 

3. A study of the recommendations on primary education made at international 

meetings since 1934 and of the trends in primary education covering the same 

period of time. Тһе variables to be considered in the trend study will be 
those emphasized in the recommendations. 

A study of the curricula in the normal schools. 

- A comparative study of the educational data collected in the 1950 and 1960 

censuses. 
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THE INSTITUTE FOR PERSONALITY AND ABILITY TESTS periodically publishes an 
information bulletin which should be of interest to many educational research 
workers. Some of the IPAT information bulletins available are: 

1. The Anxiety Factor in the IPAT Anxiety Scale: Recently Received Criterion 
Data. 

2. Predicting School Achievement from Personality, as Measured by the 
““Н5РО” and “16 РЕ” Tests. 

3. Recent Occupational Data on the 16 PF Questionnaire, 

4. Background and News on Culture Fair Intelligence Tests. 

Further information about these studies may be obtained by writing to the Insti- 

tute at 1602-04 Coronado Drive, Champaign, Illinois. 
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THE UNIVERSITY COUNCIL FOR EDUCATIONAL ADMINISTRATION announces that the 
research project, “Structure and Process of School-Community Relations, ” is 
now in its second phase. The first phase of the project, which ended in June 
1960, investigated community understanding as a factor in the financial support 


of public education. Community decision on school bond issues served as a 
focus for the study. 
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VIRGIL E. HERRICK, CHAIRMAN OF THE HANDWRITING FOUNDATION, announces 
that the Foundation plans to make available at nominal cost a report on a con- 
ference on research on handwriting which was held at the University of Wiscon- 
Sin on October 12, 1961. At least these major presentations will be contained 
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Children's Attitudes Toward Reading and Their Critical 
Reading Abilities in Four Content-Type Materials 


THE PRINCIPAL hypothesis of this study was that 
а positive relationship exists between fifth and sixth 
grade children's expressed attitudes toward four 
different content types of reading material and their 
Scores on an experimental test of critical reading. 
In addition, relationships between these critical 
reading scores and (1) attitudes expressed toward 
reading as a school activity and (2) attitudes express- 
ed toward school, classmates and teacher were ex- 
plored. Also examined were the relationships be- 
tween attitudes expressed toward the four different 
content types of reading materials and 1) attitudes 
expressed toward reading as a school activity and 
2) attitudes expressed toward school, classmates 
and teacher. The relationships between personal 
factors--sex, socioeconomic status, chronological 
age, general reading ability and intelligence--and 
the above variables were also determined. 

Subjects: The subjectsofthis study were 305 fifth 
andsixth grade children (163 girls and 142 boys). 
Asagroup they were nearly “ауегаве” as evidenced 
from data obtained as to their chronological age, 
their intelligence | Kuhlmann- Anderson Intelligence 
Test (2) and the California Test of Mental Maturi 
(their general reading ability Сашогша ne 
Test (7) and Durrell-Sullivan Reading Test (1)], and 
their socioeconomic status [Warner'sIndex of Status 
Characteristics (8)]. 


Procedures 


Measurements; Each of the subjects completed 
three attitude questionnaires. Seoreson a modifica- 
tion of Remmers' Scale for Measuring Attitude to- 
Ward Any School Subjec were interpreted as at- 
titudes toward reading às a school activity. Scores 
on Tenenbaum's School Attitudes Questionnaire (5) 
were interpreted as attitudes towar school, class- 
mates and teacher. The combined scores from a 
modified form of Thorndike's Fictitious Annotated 
Titles Questionnaire (6) plus the number ranking , 
depending on like or dislike, given to four different 
Content types of reading material actually read were 
interpreted as attitudes toward these types of stories. 
These were 1) boys’ sports stories, 2) girl's mild 
adventure stories, 3)airplane or flying stories, and 
4) manners or social relations stories. 

After completing the three attitude questionnaires, 
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each of the subjects read four original experimental 
reading passages, the content types of which were 
identified as closely as possible with the content type 
of stories toward which attitudes had been expressed. 
The reading difficulty and length of these passages 
were controlled to provide four passages of approxi- 
mate reading difficulty and approximate length. 

After reading each passage the subjects completed 
an experimental reading test consisting of reading 
for immediate recall and of critical reading. The 
reading testfor each of the four passages consisted 
of eight multiple-choice questions on recall and 
twenty -four multiple-choice questions oncritical 
reading. The critical reading questions consisted of 
eight on sensing the organization of the passage, 
eight on making inferences and eight on drawing con- 
clusions. The validity of the items in these experi- 
mental reading tests was determined from other re- 
search on critical reading measurement, from analy- 
sis of reading abilities, from the criticisms of a 
jury of faculty and of graduate students enrolled at 
the time with the writer in agraduate education course 
on children’s thinking, and from the results of a pilot 
study of the experimental reading tests. Evidence of 
the reliability of the experimental reading tests was 
obtained by calculating coefficients of correlation 
between the scores from two administrations of the 
tests given approximately two months apart. These 
correlations ranged from .77 to .84 which closely 
approximated the accepted minimum standards for 
group prediction. The tests, therefore, were judged 
to be reliable. 

Analysis of the data: The statistical procedures 
alae inanalyzing the data gathered were, first, 
to observe for significant differences the mean scores 
of the top twenty-five percent, the middle fifty per- 
cent, and the bottom twenty-five percent of the total 
population and the sex groups on each of the vari- 
ables measured. For example, the subjects were ar- 
ranged into three groups according to their scores 
made on the measurement of attitude toward a part- 
icular content type of reading material. A comparison 
was then todetermine if the differences between the 
mean scores of the different groups were significant. 
A second procedure obtainedthe coefficients of cor- 


relation between all the experimental and personal 
variables inthe study. The data from the sex groups 
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were intercorrelated as well as the data fromthe 
total group. (See Tables 1, 2 and 3.) 


Results 


The conclusions that seemed probable in the 
light of the evidence gathered were as follows: 


ds The major hypothesis of the study seemed 
to havebeen borne out. The correlations between 
attitudes toward content typesof materials and 

critical reading test scores on these types of 
reading materials ranged from .23 to .50 and 
were all significant. The recall reading scores 
were not so closely related to attitudes express- 
ed toward the material read. The correlations 
hereranged from -.12 to .3£, with most of the 
correlations below.15. Breaking the total group 
into sex groups seemed toheightenthe relation- 
ships between the attitudes toward content types 
of materials and critical reading scores. For 
example, the girls’ attitudes toward manners 
stories were more highly correlated to critical 
reading scores (. 50) than were the boys’ reading 
scores and attitudes toward this passage (. 23). 

Further substantiation of this relationship was 
shown when forty-eight comparisons were made 
of the subjects whose scores on the measures 
of attitude toward content typesof reading mat- 

erial were in the lower twenty-five percent, in 
the middle fifty percent, and inthe upper twenty- 

fivepercent. On thirty of these comparisons the 

differences in the mean critical reading scores 
of these groups was significant. All of these 
significant differences were in favor of the group 
with the higher attitude score. 

2. It appeared likely that the relationships be- 
tween attitudes toward reading as a school ac- 
tivity and the critical reading scores were much 
the same as the relationships between attitudes 
toward content types of reading materials and 
the critical reading scores. 

3. The evidence suggested that the relationship 
between general reading ability and attitude to- 
ward reading as a school activity approacheda 
substantial level, with correlations ranging 
from .34 to . 45. 

4. The relationships between gene ral reading 
ability and the attitudes towardthe four content 
types of reading material was negligible for the 
total group. Itbecame significant when the tot al 
group was divided into sex groups,however. 
(See Tables 1, 2 and 3). 

5. Substantial negative correlations were indica- 
ted between intelligence and chronological age: 
-.48 to -.52. 

6. A substantial positive relationship was indi- 
cated between intelligence and general reading 


ability as shown by correlations ranging from 
.38 to . 52. 

7. A small relationship was indicated between 
intelligence and socioeconomic status as shown 
by correlations ranging from . 36 to . 37. 

8. Negligible relationships were indicated be- 
tween (а) intelligence and the attitudes measured, 
(c) socioeconomic status and the attitudes meas- 
ured, and (d) critical reading scores and atti- 
tudes toward school, classmates and teacher. 

9. Differences between boys and girls were seen 
to be consistent throughout the study. (See Tables 
1, 2 and 3). 


Summary 


This study suggested that the reading compre- 
hension of an individual child as he reads is influ- 
enced to a degree by his attitude toward the content 
type of material being read. This seems more likely 
to be true if he is asked to reason, or to read be- 
yond the material, rather than if he is just asked to 
repeat verbatim. The study pointed up the significant 
differences in attitudes toward reading due to sex 
characteristics. It indicated that attitude toward 
reading as aschool activity has an important enough 
effect onthe other variables of the study to warrant 
serious consideration. 
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TABLE I 


` INTERCORRELATIONS OF THE EXPERIMENTAL AND RELATED FACTORS OF THE TOTAL GROUP 


AG AA AM IQ CA RA SS CB CG CA CM 
AB .01 .4T* .26* .01 DLL .12** .00 .39* .18* .28* .22* 
AG -.09 .63* .11 .04 .10 -.06 .00 .33* -.14жж .26* 
AA .li** -.06 .08 .09 -.03 .05 -.25% .32* -.06 
AM .05 .12** . 02 -.11 .04 .26* -.05 „37% 
IQ -.94* .45% .35* .48* .55* .51 .51% 
СА .09 -.19* .20* .20* .25 .2T7* 
RA .26* Fa 8 .70* .T3* . 69% о 
a 
SS Тее таже gee .12** = 
е] 
СВ .TT* .80* .81* 
CG .69* .85* 
CA .76* 
*Significant at the . 01 level of confidence. 
**Signivicant at the . 05 level of confidence; all others not significant 
Interpretation of symbols: SS - socioeconomic status 
AB - attitude toward boys’ sports stories CB -critical reading scores in the boy's sports 
AG - attitude toward girls’ mild adventure stories passage 
AA - attitude toward airplane stories CG -critical reading scores in the girl's mild 
AM - attitude toward manners stories : adventure passage 
IQ - intelligence СА - critical reading scores in the airplane pas- 
CA - chronological age sage 
RA - general reading ability CM- critical reading scores in the manners pas- 
age 
M 
сә 
= 
e 
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TABLE II 


INTERCORRELA TIONS OF THE EXPERIMENTAL AND RELATED FACTORS OF THE BOY GROUP 


AG AA AM IQ CA RA SS CB сс СА см 
АВ 11 44* 24* .20** — .10 .31* 14 43* 26% 36% .29* 
AG 22219 64* -.04 -.02 -.18** -.15 -.15 24 -.21% 07 = 
AA 00 .21 -.05 21 12 15 01 3T 03 Е 
АМ -.03 St -.17%% -.19** -.06 .13 -.12 .23 5 
IQ -.52 .38* .36* .48% .43* .50* .41* = 
CA .i9** -.19**  .17** .22* .26* 131% 6 
ВА .25* 2% .56* .80* .65* E 
a 
58 -14 .04 EU .05 > 
св 278% 287% .85* g 
CG . 69* .83* E 
Ei 
CA 177% Б 
*Significant at the . 01 level of confidence а 


**Significant at the . 05 level of confidence; all others are not significant 


Interpretation of symbols: 
AB - attitude toward boys' sports stories SS - socioeconomic status 
attitude toward girls’ mild adventure stories CB -critical reading scores in the boys’ 
attitude toward airplane stories sports passage 
AM -attitude toward manners stories CG -critical reading scores in the girls’ 
intelligence mild adventure passage 

chronological age CA -critical reading scores in the air- 
RA - general reading ability plane passage 

CM -critical reading scores in the manners 

passage 


>> 
>Q 
ба 


05 
ЖҮ 


TABLE III 


INTERCORRELATION OF THE EXPERIMENTAL AND RELATED FACTORS OF THE GIRL GROUP 


AG AA AM IQ CA RA SS CB ca CA CM 

AB 26* 42* .32* -.10 12 04 -.11 39* .08 .21% 11 
АС 00 .56% 13 .13 28* 03 21** .43* .18** 40* 

AA 24* -.13 12 -.18** -.15 00 -.35ж 26% -.07 
АМ .14 .15 .16** 01 21** .34* 22* 50* 
IQ .48 52* .97* 50* 62* 54* 65* 
CA 08 -.18** 25* .22* 25% 27% 
RA .29% 4172ж .85% .T1* .76* 
SS 13 24* .12 23* 
CB 78* TUR 81* 
CG 11% .87% 
СА 12ж 


*Significant at the . 01 level of confidence 
**Significant at the . 05 level of confidence; all others not significant 


Interpretation of symbols: 
AB - attitude toward boys' sports stories 
AG - attitude toward girls’ mild adventure stories 
AA - attitude toward airplane stories 
AM - attitude toward manners stories 
IQ - intelligence : 
CA - chronological age 
RA - general reading ability 


SS - socioeconomic status 

CB-critical reading scores in the boys’ sports 
passage 

CG -critical reading scores in the girs' mild ad- 
venture passage 

CA - critical reading scores in the airplane passage 

CM - critical reading scores inthe manners passage 
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FIELD NEWS Continued.... 


inthe report: 

1. The Present Score: Practices inthe Teaching of Handwriting in the United 
States, 1960. 

2. For Easy and Legible Handwriting. e. 

3. The Relationship Between Handwriting Pressures and Legibility of Hand- 
writing in Children and Adolescents. 

4. Perception and Skill Behavior. 

5. Children's Perception of the Handwriting Task. : 

6. A Comparative Study of the Legibility of Handwriting of 454 Adults Trained 
in Three Handwriting Styles: АП Cursive, All Manuscript, АП Manuscript- 
Cursive. 

"I. The Evaluation of Quality in Handwriting. 

Further information on this report may beobtainedfrom the Foundation at 1426 

G Street, NW, Washington 5, D.C. 


жжжж 


А. 7. KING, PRESIDENT OF NATIONAL ANALYSTS, INC., hasannounced that the con- 
fidentiality of respondents in interview research studies has been given the рго- 
tection of a judicial body. In an FTC hearing, National Analysts reported 
statistical data from a sample survey it had conducted, and the opposition counsel 
demanded the list of names and addresses of all respondents in the study. Na- 
tional Analysts cited the following principles set out by the American Psycho- 
logical Association (Principle 4. 32-1): ** Theidentity of research subjects must 
not be revealed without explicit permission. If data are published without per- 
mission for identification, the psychologist is responsible for adequately dis- 
guising their source.’’ National Analysts indicatedfurtherthat their professional 
staff would be forced to resignfrom their organization if the names and addres- 
ses of the respondents were disclosed and, second, that such disclosure would 
ultimately result in the community's refusal to participate in research studies 
since they might be subject to harrassment by lawyers and subject to subpoena 
incourt. The hearing examiner ruled that National Analysts is not required to 
disclose the nam es of respondents. This information should be useful to re- 
search workers conducting interview studies by helping to obtain the coopera- 
tion of reluctant respondents as well as in countering requests for access to 
confidential data. 


жжжж 


LINDLEY J. STILES, DEAN OF THE SCHOOL OF EDUCATION OF THE UNIVERSITY 
OF WISCONSIN, recently has disseminateda statement on the contributions and 
next steps in the Cooperative ResearchProgram. Dr. Stiles points out that re- 
search, both basic and applied, is the key to school improvement. The 83rd 
Congress, in1954, passed Public Law 531 which authorized the Commissioner 
of Education “Фо enter into contracts and jointly financed cooperative arrange- 
ments with universities and colleges and state educational agencies for the con- 
duct of research, surveys, and demonstrations in the field of education." Under 
this law the first appropriation was made in fiscal 1957. The first appropria- 
tion was almost one million dollars. The following year the budget was in- 
creased to 2.3 million dollars. For fiscal 1962, it was raised from 3.4 to 5 
million dollars. These appr opriations are small as they are compared to the 
160 million dollars appropriated for research in agriculture and almost half a 
billion dollars allocated for health research last year. The cooperative research 
program has demonstrated the value of basic research as a foundation for 
school improvement. It now stands ready to move ahead to accelerate basic 
research discoveries into educational programs. The following areas illustrate 
the kinds of topics conducted under the Cooperative Research Program: 


1. The nature of creative ability. How such talents may be identified and de- 
veloped through education. 


. .. Continued on page 330 
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Retention and Transfer in Children of Low, 


Average, and High Intelligence 


HERBERT J. KLAUSMEIER 


University of Wisconsin 


THE LACK OF comprehensive knowledge about 
learning phenomena in classroom settings requires 
experimentation with human subjects in school set- 
tings. Further, since education is concerned with 
Socially useful learnings, the results of laboratory 
experiments which employ nonsense rather than 
meaningful learning materials must be reexamined 
for their applicability to classroom situations. In 
this study, retentionandtransfer abilities were com- 
pared among children of low, average, and high in- 
telligence. Each child learned to solve a socially 
useful problem, graded in difficulty to his present 
achievement level; and subsequent measures of re- 
tention and transfer were secured and analyzed. 


Procedure 


Subjects Тһе subjects were 20 boys and 20 girls of 
low intelligence (WISC IQ 50-80), 20 boys and 20 
girls of average intelligence (WISC 10 90-110), and 
20 boys and 20 girls of high intelligence (WISC IQ 
120-146). Thelow IQ children were drawn from four- 
teen special classrooms for mentally retarded edu- 
cable children. Any low IQ child who showed char- 
acteristics of organic involvements such as mongol- 
ism, or who was severely handicapped orthopedically 
or in hearing, vision, or speech was excludedfrom 
the study. Similarly, no average or high IQ child 
exhibited any serious physical defect. The average 
and high IQ children were drawn from fifteen regular 
fifth-grade classrooms. 

All children had birthdates between September 15, 
1947, and December 15, 1948. The mean age of chil- 
dren at the midpoint of this study was 131 months. 


a eee 


* The research reported herein was perfo 
tion, Department of Health, Education a 


JOHN CHECK 


University of Michigan, Flint College 


Method A group of twenty-nine problems at three 
main levels of difficulty was arranged, one difficulty 
level foreachIQgroup. After extensive experiment- 
ing with pupils in four fifth-grade classrooms and 
with a small group of low IQ children, none of whom 
were subjects of the study, the problems for low IQ 
children were further subdivided with the lowest level 
intended for children who did not know the number of 
centsina dime or who could not do simple addition. 
A reserve level of problems of high difficulty was 
established for any high IQ child who might solve the 
assigned problem in less than three minutes. The 
assignment of problems within each difficulty level 
to individual children was made randomly, in order 
to carry outa combined retention and transfer design, 
described later. 

Table I shows the number of children whose final 
problem was at each level, and it gives a sample 
problem ateachdifficulty level. Eleven low IQ boys 
and thirteen low IQ girls performed at the lowest 
level of difficulty, and one of three problems at this 
level was to make eight cents with four coins. One 
high IQ boy and one high IQ girl were at the R, re- 
serve level of difficulty. All problemsin Level A re- 
quired making a specified amount of money with the 
least possible number of coins; all problems in other 
levels requiredusing alarger number of coins; e.g. 
Fifty-seven cents with nine coins. The average and 
high IQ children used only paper and pencil in solving 
the problem, however, actual coins were used by the 
low IQ children. 

In the teaching-learning situation, the examiner 
demonstrated how to make a specified amount of 
money witha specified number of coins, such as six 


rmed pursuant to a co ntract with the United States Office of Educa- 
nd Welfare. 
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cents with two coins. Following the demonstration, 
the examiner presentedthe child his assigned prob- 
lem and started the stop watch. If the child solved 
the problem in less than three minutes, he was given 
a problem at the next higher level and time was 
started over. Cues or helps were provided at the end 
of six, nine, and twelve minutes according to a pre- 
arranged schedule inorder to make certain that each 
child reached a solution to the problem in fifteen 
minutes. By design, the possible range of time to 
complete the problem was between three and fifteen 
minutes. 

Immediately after the solution of the problem, a 
five-minute period of controlled interpolated activity 
Started during which the child listened to a recorded 
Story and severalsongs. Thenthe original problems 
were presented againto 60 of the children (retention 
group) and the time to solve and the cue level were 
noted. Seven weekslater, the same task with identical 
directions was presented to the 60 children in the re- 
tention group. Now follows a description of how a 
transfer group of 60 children were treated. 

The 120 children were randomly divided into two 
groups, one designated a retention group and the 
other a transfer group. Thirty boys and thirty girls, 
ten from each IQ level, were randomly drawn and 
assigned transfer problems. Each of these 30 boys 
and 30 girls was assigneda different problem but at 
the same level of difficulty as he had originally 
learned. For the five-minute transfer measure then, 
the 60 children designated as transfer subjects re- 
ceived different problems, whereas the 60 children 
designated as the retention Subjects were given the 
identical problems they had learned originally. Sim- 
ilarly, for the seven-week retention measure the 
retention subjects solved the same problems as be- 
fore; whereas the transfer Subjects were assigned 
new problems, 


Results 


The design of this Study required the problems 
to be graded to each child's achievement level. The 
means and standard deviations of the acquisition, 
five-minute, and seven-week retention and transfer 
scores in seconds are given in Table П. The mean 
time-to-learn for the low, average, and high IQ re- 
tention groups was much alike---569, 590, and 542 
seconds, respectively; the difference among the IQ 
;roups was not significant at the . 05 level. 

The results of the four analyses of covariance 
Edwards, 1950) of the acquisition and the retention 
г transfer measures with acquisition the covariate 
ire shown in Table III. The largest F of 2.13 does 
iot approach significance at the .05 level. Thus, 
/hether the relationship is between acquisition and 
elearning or between acquistition and transfer, the 
ifference among the three IQ groups is not statistic- 
lly significant at the . 05 level. This same conclu- 
ion applies to the intervals of five minutes and 
^ven weeks after original learning. Further, the 


small differences, which are not statistically signif- 
icant, areconsideredofno practical significanc e 
for educational practice; that is, when children of 
low, average, andhigh intelligence receive learning 
tasks graded appropriately to their levels of achieve- 
ment, they retain and transfer equally well to new 
situations of appropriate difficulty. 


Discussion 


The present design to deal with retention and 
transfer differs markedly from any previous one the 
researchers have encountered. The main differences 
are that in the present study, the subjects repre - 
sented three distinct IQ levels, only Socially useful 
arithmetic learnings were used, and the learning 
tasks were graded to the child's present achievement 
level. Theresearchers believe that studies of reten- 
tion conducted over short time spans, or basedon the 
Subjects’ learning meaningless material as reported 
through the years by Ebbinghaus (1885), Luh (1922), 
Postman (1948), and Archer (1953) offer little for 
understanding retention in classroom learning situ- 
ations or in human beings generally. 

Our purpose was in line with Ausubel’s proposal 
(1953) that researchers deal ing with educational 
problems should direct their efforts toward clarifying 
the nature of thelonger-term acquisition and reten- 
tion of meaningful outcomes and methods of learning. 
And since socially useful arithmetic learnings were 
used rather than nonsense material S, the usual meth- 
ods ofmeasuring retention of nonsense material 
Such as reproduction, recall, recognition, and an- 
ticipation were not applicable. 

Inthe present study, thechild could practice be- 
tween the five-minute and seven-week test, if he re- 
called the problem. Thus, some efficiency may 
have beenlost in the presentdesign if the use of non- 
sense material actually prevents practice between 
acquisition and subsequent securing of retention or 
transfer measures. However, during the five-min- 
ute interval between acquisition and the first re- 
tention or transfer measure, the researchers had 
the child under their jurisdiction and kept him en- 
gaged in activities unrelated to the task. Since the 
results of the study werethe same for the five-minute 
and seven-week interval after acquisition, the re- 
Searchers assume that if practice occurred,its in- 
cidence and effects were equal among the three IQ 
groups. 

Speed of learning was not a critical element of the 
design except within the practical timelimit imposed 
to complete the teaching-learning sequence with each 
child. Rather than emphasizing Speed, the research- 
ers graded the tasks so that at the three IQ levels 
children wouldlearnthe tasks equally well. In turn, 
the results obtained are not directly comparable with 
those of many previous studies such as Gillette (1936) 
Leavitt (1945), Gilliland (1948), Zimny (1949), Un- 
derwood (1954), and Rustebakke (1954). The incon- 
clusive findings of these six investigators about re- 


KLAUSMEIER - CHECK 


2 TABLE I 


NUMBER OF CHILDREN WHOSE PROBLEM WAS AT EACH LEVEL OF DIFFICULTY 


Level of Amount No. of Coins and/or Low IQ Average IQ High IQ 
Difficulty Required Bills to be Used Boys Girls Boys Girls Boys Girls 
А-1 8£ 4 11 13 
А-2 13¢ 4 2 5 
A-3 476 5 T 2 
B 574 9 18 19 
с 1.97 9 2 1 184 19 
R* $14.53 16 1 1 


* Level R was prepared for high IQ children who solved their originally assigned problem atlevel C 
in less than three minutes. 


TABLE П 


MEANS AND STANDARD DEVIATIONS OF THE TIME-TO-LEARN, FIVE-MINUTE 
AND SEVEN-WEEK RETENTION AND TRANSFER SCORES 


Time-to-Learn 5-Minute Retention 7-Week Retention 
in seconds in seconds in seconds 
N "Mea S.D. "Mean 80. ean 

Retention Low 20 569.20 256.77 107.55 214.14 189.00 246.71 
Average 20 590.30 252.31 84.70 183.16 362.75 298. 42 

High 20 541.70 238.44 145.85 241.67 233.75 212.47 

Transfer Low 20 679.15 146.73 501.25 303.01 415.90 287.72 
Average 20 665.00 252.35 451.95 292.26 467.35 293.00 

High 20 615.25 246.83 494. 95 310.34 378.40 275.92 

Method Retention 60 567.06 249.29 112. 69 214.32 261.83 254.98 
Transfer 60 653.13 220.70 484.71 301.05 420.55 285.64 
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TABLE III 


ANALYSIS OF COVARIANCE FOR ACQUISITION, FIVE-MINUTE, AND 
SEVEN-WEEK RETENTION AND TRANSFER 


Analysis Source of Sum of Squares of Mean Significance 
Area Variation Errors of Estimate — df Square F ot F 
Acquisition — Adjusted means 59326.3 2 29913.1 61 P .05 
5-Міп. Ret. Within groups 2486921.3 56 44400. 3 

Total 2546147. 6 58 
Acquisition — Adjusted means 286115.8 2  14307.9 2.15 P .05 
7-0%. Ret. ^ Within groups 3141802. 6 56 66925. 0 

Total 4033918. 4 58 
Acquisition — Adjusted means 26032.1 2 13000.0 112 P .05 
5-Min. Trans- Within groups 5936295. 8 56 106005.2 
fer 

Total 5962327. 9 58 
Acquisition — Adjusted means 78010.0 2 39035.0 — .42 P .05 
T-Wk.Trans- Within groups 5173764.0 56 92388. 6 
fer 

Total 5251834.0 58 


tention in fast and slow learners may have occurred 
because the same task, rather than gradedtasks, 
was used with all subjects when originally identifying 
their fast and slow learners. 

In this study, no difference of significance was 
found in retention of arithmetic learnings in children 
of low, average, and highintelligence with thelearn- 
ing task graded to each child's achievement level. 
Wouldthe same results have been obtained in another 
area of learning in the cognitive domain, for example 
vocabulary, reading, science concepts, or social 
studies concepts? The researchers think so. They 
have high confidence inthe results and suggest that 
one of the most important implications of the study 
is thatclassroom teachers should find out the pres- 
ent achievement level of the child. Upon ascertaining 
this and knowing learnings at next higher levels of 
difficulty, the teacher should offer individualized 
instruction or arrange the children in smaller groups 
as necessary to provide instruction which will en- 
courage efficient acquisition and subsequent reten- 

Children of low (55-80 IQ), average (90-110 IQ), 

and high intelligence (120-146 IQ), at131 months of 
age did not differ significantly in relearning and 
transfer abilities when the arithmetic problem to be 
learned was graded to the learner's achi evement 
level. In both relearning and transfer and at time 
intervals offive minutes and seven weeks, no differ- 
ence among the three IQ groups was found to be sta- 
tistically significant at the .05 level. Implications 
of this conclusion for research design and teaching 
practices were presented. 
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An Inquiry Into the Relationship of Achievement Scores 
And Certain Variables in an Adult High School 


THE PURPOSE of this study was toascertain 
whether motivation, whichispresumedin adults who 
voluntarily continue their education below college 
level, has an effect on achievement scores. A sec- 
ondary purpose was to determine if certain variables 
of status and race had any significance. 


Specific Hypotheses Tested in the Study 


The study set up hypotheses to be tested by sta- 
tistical comparisons and correlations. The study 
was made in a private high school for adults in Cleve- 
land, Ohio, where the high school program was ap- 
proved by the State of Ohio. The study included 100 
Subjects, male and female, whose age ranged from 
16--48 years, with an age mean of 22.31 years. It 
excluded those who were taking courses purely for 
economic reasons or otherwise. 


Hypothesis І: That there was no relation- 
ship between achievement scores and 
previous education. 


Hypothesis II: That some relationship 
between achievement scores and schol- 
astic ability existed within sub-g roups 
peculiar to this population. 


Definition of Terms, Procedures, and Findings 


In the school under study, at the completion of 
the program prescribed, a student may earn a high 
School diploma certifying that he has finished 16 u- 
nits of high school credit, each unit consisting of 120 
class hours of instruction. The majority of students 
come with some high school units previously earned 
elsewhere. In this study, these earlier units are 
termed **previous education. ’” 

The school year is divided into four three-month 
terms. The achievement scores were those earned 
On final tests and other course work, representing 
final grades for one three-month term. 


Findings relative to Hypothesis I 


i т §.E.ofr 
Previous education Mean Sena 


(units) 8.265 4.56 
Achievement scores 78.70 8.46 .035  .099 


HENRIETTA H. CROUCH 
Cleveland, Ohio 


HypothesisIis supported by the data obtained since 
the relationship of .035 is not significant. 

The scholastic ability was determined by scores 
on the Army General Classification Test (AGCT). A 
relationship wasfirstestimatedfor the entire group 
of 100, between achievement scores and scholastic 
ability as measured by the AGCT. These were the 
findings: 


Mean Sigma r S Eof r 
AGCT 105.30 16.07 
Achievement 78.70 8.46 .16 . 097 
scores 


This relationship of .16 is not significant. Be- 
cause the sample included those between the ages of 
16 and 18 years who are ‘‘forced’’, either by state 
law and/or parental pressure, to go to school, the 
sample was further subdivided into two sub-groups, 
those 18 years or younger, and those 19 years or 
older to see if the motivation which is presumed in 
the older group can be statistically proved. These 
findings are shown at the top of the next page. 

The above 23 percent of the population 16--18 
years old represents current drop-outs from local 
schools for reasons of illness, bad conduct, and 
other personal maladjustments. As mentioned, they 
are forced to go to school by parents or by virtue of 
the state law, and thislow motivation may be one of 
the factors which interferes with a showing of higher 
correlation between aptitude and achievement. The 

05 relationship is not significant. 

In validating the AGCT, success in training was 
the most commonly used criterion. The lowest re- 
corded correlation with academic grades for 119 
trainees of a similar mean, 107.01, was . 26 (Ref- 
erence No. 19, pp. 128-129). The only other low 
correlations are reported in specialized college 
courses and most West Point courses as ranging 
from .12 to.40, although correlations with grades 
in English and Mathematics at West Point were . 40 
and .43. Super quotes E. K. Strong, Jr. to the effect 
that undermined motivation is the reason for the lower 
correlations in the specializedtraining since a sub- 
stantial number of men hadlittle genuine interest in 
this training. 

The correlation of . 23 for the 77 percent of pop- 
ulation 19--48 years oldis significant at the .05 level 
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N:23 - 16-18 years old - Mean 
AGOTE ЗА а н с Т 06802 
Achievement всогев....... d 76.35 
N:77 - 19-48 years old - Mean 
AGO е ah aac utente eee 104. 935 
Achievement scores. ....... 79.40 


Critical ratio between r’s 


19-48 years. . . . . .23 
16-18 years. . . .. .05 
Difference .18 C.R. 


Sigma r S.E. ofr 
14. 92 

8. 98 .05 213 
Sigma r S.E. of r 
16. 40 

8.13 .23 108 


. 735 - not significant 


eee 


of confidence. Itis a low degree of correlation, yet 
stable enough that such a measure would occur by 
chance 5 times or less in 100 such comparisons. It 
is known that they have been motivatedtoa peak-level 
of achievement by these factors; refusal of a job, lack 
of promotion, teachers’ prodding, subject-content of 
courses, inadequacy in comparison to peers, and 
others, However, a tempering factor is the inter- 
ference of work time with study time. This may in- 
clude poor time-budgeting, lack of discipline and 
effective study habits, as well as overwhelming fi- 
nancial responsibilities. There are findings by Cal- 
vin et al. and Leestma (References 2 and 7) which 
include lack of self-confidence, student fatigue, 
“‘rustiness’’, lack of interest in background and the- 
ory. 

The sample of 100 was further subdivided into 
dichotomous sub- groups: 


Married - Single 
Veterans - Non-veterans 
White - Negro 


The findings on the first two sub-groups follow: 


The t-ratios and chi-squares indicate that there 
is no real divergence within these sub-groups; mar- 
ried-single, veterans - non-veterans. 

The findings onthe white and Negro students will 
be found on the top of the next page. 

The critical ratios of the differences inboth ap- 
titude and achievement between white and Negro 
students are very significant, beyond the .01 level 
of confidence. Within the group of 100 subjects under 
study, therefore, the null hypothesis is not tenable 
at the .01 level of significance. 


Interpretation 


The full significance of these figures is lessened 
by the inadequacy of the sample: 100 subjects. How- 
ever, for a random sampling in the city of Cleve- 
land in 1959, the ratio of 71 white, 29Negro is a 
good percentage as compared with other population 
figures which estimate the Negro population in south- 
ern and border states as 20.6 percent. The last re- 
liable census figures for the city of Cleveland were 
available only for 1950, at which time it was 16.16 
percent. 


Sub-group : Number Sigma t-ratio Chi-square 
AGCT 


Single, Pte UT 17 17. 04 
Married. M ЕА 23 11.51 0.1136 
Achievement 
Single лат Т NEUE we 8.535 
Married о 23 8.185 0. 3161 .0291 


ROH: Steere ph) a eum ye Se Se LS 


VeLeranga а уы, TEM Bo 45 13. 86 
Non-veterans ....... 55 17.73 0,1677 
Achievement 
Veterans eorr sa ote T В 45 H 8.87 
Non-veterans......... 55 78.545 8. 26 0.0059 . 0004 
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AGCT 
Уе ае ален) 108.17 16.095 
Negro. УИН 29 98.28 13.63 : 3.110 
Achievement 
һе...» е «E 80. 38 8.30 
Negro =i. lcis ШШ М . 29 14.59 1.38 3.49 14. 66 


A majority of the Negroes in the school under 
study have received their previous educationin 
Cleveland public schools. The distribution is as fol- 
lows: 


From schools in Cleveland 18 
From schools outside the city 2 
From schools outside the state 9 

25 


The disparaging conclusions which might be drawn 
is lessened by the knowledge of some civic factors 
such as: the two living areas of the Negroes were 
rated the most dangerous in regard to crimes, al- 
though there is visible an intelligent attempt at this 
problem-solution. Also, the Negro students in this 
Study are only three or four generations removed 
from the illiterate, slavery status of their ancestors. 
The white students, without exception, have a legacy 
Of literacy and freedom for at least this many gen- 
erations, if not further. 

One of the chief interpretations in regard to the 
Negroes in this study is that they have done more, 
achievement-wise, with less aptitude. 

Although differences within the sub-groups, ех- 
Cluding the white and Negro, are not significant, 
there is a slight variation in their hierarchy of dif- 
erences; 


Aptitude Critical Ratio 
Young - older 0. 4427 
Veterans - non veterans 0. 3413 
Single - Married 0.2081 
Achievement. 
Older - young 1.48 
Married - single 0.4736 
Veterans - non veterans 0.008 
Follow-up 


Further follow-up was conducted one year later 
Ю determine drop-outs and completions. There is 
no Significant difference in the percentages of the 
Young and older students graduated, dropped out, ог 
Temaining. Judging from the mean units of previous 
Education, the expected number, 51 percent, were 
Sraduated within one year; those remaining repre- 
Sented 15 percent; those whodropped out, 34 percent 


distributed as follows: 


16--18 years old: 22 percent 
19--48 years old: 12 percent 
Critical ratio .44 


This differenceis notsignificant, but of the crit- 
ical ratios inthe three categories: graduates, drop- 
outs, remaining, itcan be considered the least neg- 
ative. 


Over- and Under-Achievers 


The study revealed only an insignificant number. 
There were 4 over-achievers and 4 under-achiev ers. 
Complete statistical data would be valuable only for 
individuali counseling. 


Achieve- 
Partial datafollow: Means:Age АССТ ment 
4 Over-achievers 5 К Б 
4 Under-achievers 22 115 67.25 


1. The study of 100 subjects, with an age mean 
of 22.31 years, showed that length of previous ed- 
ucation showed very little correlation with achieve- 
ment scores. 

2. The hypothesis that age is a minor factor would 
bear more investigation since this study proved that 
correlation between aptitude as measured by the 
AGCT, and achievement scores for 77 subjects, 19 
to 48 yearsof age, showed a slight and more depend- 
able increase over that of a group which included 
23 percent of younger students. 

3. Since the correlation between aptitude and 
achievement for students 16 to 18 years of age was 
lower than expected from other reliable data on the 
AGCT, these findings would suggest that other fac- 
tors besides intelligence were more important. 

4. Because of the small sampling; no generaliz- 
ations involving any sub-groups can be made. But 
within the limitsofthis study, there were no signif- 
jcant differences between the single and married, 
veterans and non-veterans. 

5. In the case of the white and Negro students , 
the conclusion that can be drawn is that in a random 
sampling of one hundred students where the Negro 
was fairly represented, in an industrial city with 
many cultural advantages, the Negro population 
showed a decided inadequacy when compared with a 
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white populationin scholastic aptitude and scholas- 
tic achievement. But by an interpretation of contrib- 
uting factors, it may be stated that the Negro popu- 
lationinthis study has done more, achievement-wise, 
with less aptitude. 


18 
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Dangers Inherent in Correlating Averages 


OVER A PERIOD of years many studies inedu- 
cation and psychology have been conducted in which 
Statistical techniques involving the correlation of 
averages have been employed. Few basic statistical 
texts have devoted sufficient coverage to the dangers 
inherent in the use of this technique. A survey of ten 
basic statistics texts indicated that seven of the ten 
texts included no cautions concerning the correlating 
of averages. One of the texts pointed out the possi- 
bility of obtaining coefficients that were spuriously 
high (2), and two texts emphasized the fact that the 
correlation among means might actually be reversed 
5$ sign from the correlations within the samples (3, 
4). 

Garrett (2), in his treatment of the correlation of 
averages, cites correlational studies containing state 
oroccupational averages as illustrations of spurious- 
ly high correlations resulting from the canceling out 
of factors which ordinarily reduce correl ations. 
There is ample justification for the assumption that, 
in general, correlations among samples tend to re- 
sult in higher coefficients than correlations with in- 
dividual scores. It is important to note, however, 
that with certain variables in education the reverse 
tendency might prevail. This would be particularly 
true іп correlational studies involving teachers’ 
marks. Teachers seemto have a tendency to utilize 
the class as aframeof reference, sothat in a school 
System in which traditional grading practices are 
employed, a boy of average intelligence in a bright 
Class will not fare nearly so well as does his counter- 
part in a dull class. Under ordinary circumstances 
Such relativism in grading practices would lead to a 
Situation in which thecorrelations within classes 
would be of greater magnitude than the correlation 
among the classes. This would be particularly true 
in school systems which feature homogeneous group- 
ing. 

There is also a distinct possibility that although 
correlations within classes are positive and high that 
the correlation among theclasses might be negative. 
Lindquist (4) in his discussion concerning the possi- 
ble reasons for the correlation among the means 
being negative whilecorrelations within the samples 
аге positive and high, states that such an occurance 
would be very rare. If the possibility of sucha phen- 
omenon occuring is as rare as Lindquist assumes, 
there would be littie need for extended coverage of 
Й eee 
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this topic. However, there іѕ геазоп to suspect that 
this phenomenonis not too uncommon in the area of 
reporting or marking studies. Research findings in- 
dicate that teachers with certain attitudes are much 
more prone to fail students than are other teachers 
(6). Myriad investigations have concluded that there 
is a significant positive correlation between sc hool 
grades and 1. Q. (7). Thus, thereis the distinct pos- 
Sibility that, when teachers who are harder markers 
have brighter classes, there will be a positive cor- 
relation between grades and I. Q. within each class, 
and a negative correlation among the means of the 
classes. Guilford (3) hasgiven an excellent illustra- 
tion of this phenomenon with actual data. 

The correlation of averages is particularly ap- 
pealing when one of the variables within the sample 
is dichotomous. Many investigators have calculated 
fractional averages for groups, and then employed 
these averages in determining product-moment со- 
efficients of correlation rather than calculating bi- 
serial correlations. An illustration of this would be 
a correlation between the fractional number of fail- 
ures in a class and the average I. Q. of the class. 
Yet in this area where correlation of averages is 
mostappealing, itis most susceptible toerror. This 
is particularly true when the division point ofthe 
dichotomy is not close to 50 percent of the sample 
and there are marked differencesin the homogeneity 
of the samples. 

Inthe elementary schools today there is a distinct 
trend toward marking on an ‘‘individualized’’ basis 
(5,9). Marking on an ‘‘individualized’’ basis involves 
a comparison of the accomplishment of the student 
in relation to his ability (1). This type of appraisal 
of student progress supposedly does not rate the 
quality of an individual's performance in relation to 
some groupnorm or standard. It is common practice 
in a program of thiskind to rate the performance of 
a student either satisfactory or unsatisfactory. 

The present investigation concerning the dangers 
inherent inthe use of statistical techniques involving 
the correlation of averages was an outgrowth of a 
study examining the relationship of I.Q. to satisfac- 
tory or unsatisfactory gradesina district which had 
adopted an ‘‘individualized’’ program of reporting 
pupil progress. · 

In the current study the sample was limited to 
fourth and sixth grade students in the public elemen- 
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TABLE I 


THE NUMBERS AND PERCENTAGES OF BOYS AND GIRLS RECEIVING SATISFAC TORY 
AND UNSATISFACTORY GRADES ACCORDING TO I Q CLASSIFICATION 


AT THE FOURTH AND SIXTH GRADE LEVELS = 

[s] 

9 

I.Q. E. 

Grade . 10-890 ^ 90-100 110-120 ^ 130: Total S 

No. Per- o. Per- o. Per- o. Per- o. Per- ғ 

Cent Cent Cent Cent Cent g 

4th Grade із 

Boys - Satisfactory 8 22.8 56 47.4 91- 1121 59 83.1 214 60.8 E 

Unsatisfactory 2T NA 62 52.5 37 28.9 12 16.9 138 39.2 g 

5 

Girls - Satisfactory 6 33.3 69 64.4 111 87.4 60 98.4 246 78.6 2 

Unsatisfactory 12 66.7 38 35.5 16 12.6 T 1.6 67 21.4 н 

т 

E 

[^] 

6th Grade a 

Boys - Satisfactory 14 36.8 60 56.1 69 61.6 33 76.7 176 58.7 B 

Unsatisfactory 24 63.2 47 43.9 43 38.4 10 23.3 124 41.3 a 
Girls - Satisfactory 6 37.5 75 68.8 108 81.8 31 93.9 220 75.9 
Unsatisfactory 10 62.5 34 31.2 24 18.2 2 6.1 "0 24.1 
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tary schools of a suburban Long Island school dis- 
trict. There were 352 boys and 313 girls in twenty- 
nine classes at the fourth grade level, and 300 boys 
and 290 girls in twenty-one classes at the sixth grade 
level. The basis for class grouping was geographic 
locale. Anintelligence test was administered in or- 
der to ascertain the I. Q.'s of the students. The re- 
port card of each student was analyzed to determi ne 
whether the pupil had received satisfactory or un- 
satisfactory grades. The numbers of students re- 
ceiving satisfactory and unsatisfactory grades at 
different I. Q. levels were tabulated at both fourth 
and sixth grade levels. Separate tabulations were 
made for boys and girls because of the consistent 
research findings in favor of girls (8). The criterion 
forasatisfactory student was possession of a report 
card which did not contain a single unsatisfactory 
mark. An unsatisfactory student was one who ob- 
tained a reportcardcontaining one or more unsatis- 
factory marks. Bi-serialcoefficients of correlation 
between I.Q. and satisfactory andunsatisfac tory 
grades were determined separately for boys and girls 
atboth the fourth and sixth grade levels. Product- 
moment coefficientsof correlation were also deter- 
mined between the average I. Q’s of the classes and 
the fractions (percentages) of satisfactory students 
in each class at both gradelevels and for both sexes. 


The Findings 


The data concerning the numbers of satisfactory 
and unsatisfactory students at different I. Q. levels 
is presented in Table I. From the data in Table I it 
is readily apparent that for both sexes and at both 
grade levels as the I. Q. rises the number of unsat- 
isfactory students decreases. There is obviouslya 
positive correlation between school success (satis- 
factory grades) and I. Q. 

The bi-serial correlations between I. Q. and sat- 
isfactory and unsatisfactory grades at the fourth 
grade level were .53 for the boys апа. 59 for the 
girls. At the sixth gradelevel the correlations were 

‚ 30 for the boys and .37 for the girls. All four of 
these correlations were significant at the . 01 level 
of confidence. 

The product-moment correlations between the 
average І. Q. "s of the boys in each fourth grade class 
and the fractions (percentages) of boys in each class 
receiving satisfactory grades was -. 37. For the girls, 
it was -.29. Atthe sixthgrade level the correlations 
for the boys and girls were. 32 апа. 35 respectively. 
The only one of these correlations which is signifi- 
Cant is the one for fourth grade boys. It is significant 
at the . 05 level. 


Discussion 


A perusal of the numbers and percentages of boys 
and girlsreceiving satisfactory and unsatisfactory 
grades according to І. Q. classification at the fourth 
and sixth grade levels manifested an obvious posi- 
tive correlation between grades and I.Q. for both 
sexes at both grade levels. A determination of bi- 
serial coefficients of correlation indicated that the 
obvious correlations were all significant at the . 01 
level of confidence. Product-moment coefficients of 
correlation between the average I.Q's and the av- 
erage passing rates inthe classes resulted, however, 
in negative correlations at the fourth grade level and 
positive correlations at the sixth grade level. 

The correlating of averages at the fourth grade 
level resulted in coefficients which were in the op- 
posite direction of whatthey should have been. Some 
very lucid explanations of this phenomenon are avail- 
able (3,4). Thefindingsof this study, however, tend 
to corroborate the hypothesis that correlation with 
averages is particularly questionable in the area of 
teacher grading. 
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2. The nature of teaching competence. How do different types of teachers affect 
the learning of the students? How may good teachers be developed? 

3. Can elementary school children learn elements of the basic structure of high- 
er mathematics ? 

4. What is the effect of size of school organization on student learning? 

5. How can mentally retarded children be identified? 

6. How may the rate of learning of the blind be increased? 

7. What are the causes of juvenile delinquency ? 

8. What is the impact of home and school environment upon adolescent motiva- 
tion to learn? 

9. What are reliable standards of the identification and development of academic 
abilities? 
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THE DRAKE TEST OF MUSICAL APTITUDE was found to have predictive validity for 
achievement in music of fifth- and sixth-grade students in Colorado Springs 
School District No. 11. A follow-up study with seventy-two sixth-graders re- 
vealed high correlation between musical aptitude and subsequent achievement 
scores. 
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THE INSTITUTE OF COMMUNICATIONS RESEARCH of the University of Illinois is con- 
ducting a study of certain developments in mass communications and mass cul- 
ture in the United States and selected western European and eastern European 
countries. The three major objectives of the current phase of the study are: 


1. To assess the portrayal of teachers, teaching, and schools in massed pro- 
duced entertainment andinformationthroughanalysis of mass media content. 

2. To SOSTA processes of decision making and control affecting such por- 
trayal. 


3. To appraise the nature and effects of contacts, if any, between the mass 
media and the educational professions. 
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Readers are invited to send items of research interest to 
John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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Student Values and Grades in General Psychology 


IN A RECENT article, the writer presented a 
technique of assessing student values by means of 
the Semantic Differential (5). The results of this 
investigation showedthat, for the small group stud- 
ied, grades received in a freshman psychology 
course were related to similarity in values between 
Student and instructor and also to the student's a- 
bility to predict his instructor's values correctly. 
Inthe same article, the writer pointed out that the 
Scoring of the Semantic Differential to obtain a mea- 
sure of the value dimension was quite laborious. The 
purpose of the present paper is to examine the re- 
sults obtained from a short value-ranking method 
used to study the same general hypotheses, in order 
to ascertain whether or not this more convenient 
technique is sufficiently valid to permit its substitu- 
tion for the Semantic Differential. 


Method The subjects of this study were 42 non- 
freshman college students (24 men, 18 women) en- 
rolled in a course in general psychology taught by 
the same instructor as in the previous study. It 
Should be pointed out that the characteristics of the 
Students in the two studies are probably quite dif- 
ferent, in that the present group consists of sopho- 
mores and upper classmen taking a more difficult 
course from an instructor whose reputation for 
Scientific rigor was probably known to some of the 
Students before they enrolled in his classes. The 
Original group studied was composed of freshman 
males taking the first elementary course taught by 
the instructor atthis college. і 
Each student during the semester took the Otis 
Quick Scoring Mental Ability Test and the Gough E- 
F Scale (3), which measures anti-democratic and 
ethnocentric values and which Gough found to be cor- 
related with grades. At a later date, each student 
Was presented with alistofthe same fifteen concepts 
used in the original study? and asked to rank them 
intermsof how valuable, good, and worthwhile each 
was to him personally When he finished, the student 
was then asked to predict the rankings actually given 
these concepts by his professor, who had previously 
ranked them. The student’s self-ranking for each 
Concept was subtracted from the professor’s self- 
ranking and the individual discrepancies summed to 
give an overall **Student-Professor"' value discrep- 
ancy score. Next the discrepancies between ће pro- 
fessor’s self-rankings and the student’s prediction 
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of those rankings were summed to obtain a measure 
of the student's ability to predict his professor's 
values (**Professor-Predicted" score). These two 
Scores and the E-F Scale and Otis IQ scores were 
then correlated with the total number of points re- 
ceived onall objective examinations given during the 
semester, which wasthecriterion of grade achieve- 
ment. 

Before analyzing the results, it was decided to 
omit from consideration the five students with the 
highest Otis IQandthe five with the lowest IQ. This 
was done on the basis of a recent articleby 
Spielberger and Katzenmeyer, who showed that a 
test of personality, such as the Taylor MAS, cor- 
relates with grades for students of average intellec- 
tual ability, but not for students outside the middle 
range (4). Their reasoning applied to this study would 
be that a student of low intellectual ability would not 
be able to obtain good grades despite his values, 
whereas a bright student could obtain such grades 
even withdiscrepant values. Onlythese ten extreme 
cases were omitted, since discarding more would 
have drastacally lowered the group size. The final 
group consisted of 20 men and 12 women, with a 
median age of 20. 5 апа a median IQ of 117. 9. 

Based on previous studies (1,3,5), four major 
predictions were made that grades in this course 
would be positively correlated with: 

a. Low Student- Professor discrepancy score 

b. Low Professor-Predicted discrepancy scores 

c. Low score on Gough's E-F Scale 

d. High Otis IQ score 


Results and Discussion The rank-order coefficients 
obtained by inter-correlating each of the above var- 
iables are listed in Table I. Since the sampling in 
this study is far from random, it is probably not 
statistically correct to evaluate the results by the 
usual tests of significance. 


It will beseen from Table I that three of our four 
predictions are confirmed." Grade achievement is 
related to the degree of similarity between the values 
of the student and his instructor, to the intensity of 
the student's democratic, unprejudiced, and broad- 
minded values, and to his intelligence. The pattern 
of inter-correlations among these three predictor 
variables indicated a positive relationship between 
intelligence and democratic values onthe Е-Е Scale, 
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a finding which also was obtained by Gough (3). The 
E-F Scale and Student-Professor discrepancy are 
moderately correlated, indicating that to some ex- 
tent having values similar to the instructor's means 
having democratic, nonethnocentric values. The IQ 
Score is not highly correlated with Student-Professor 
discrepancy, and these appear to be disparate fac- 
tors, although each correlates with grades, and to 
some extent with Е-Е Scores. In the previous study 
(5) the Student-Professor discrepancy score (based 
on the Semantic Differential) and the ACE-Linguistic 
Score both correlated significantly with grades but 
not with each other. In any case, these three vari- 
ables (IQ, E-F, andStudent-Professor discrepancy) 
all correlate with achievement, although the cor- 
relations with each other are complex. 

This group of subjects is too small toevaluate 
the relationships between these variables and grades 
for students at each end of the IQ distribution. For 
the ten extreme IQ students taken as a group, how- 
ever, the correlations between grades and Student- 
Professor discrepancy and E-F score were quite 
low, whereas the correlation between grades and IQ 
was .59. This problem is of sufficient interest to 
warrant further study with a larger sample. 

Our fourth prediction, that grades would be cor- 
related with the student's ability to predictthe pro- 
fessor’s value rankings was not confirmed. There 
Seems to be no ready explanation for the fact that 
this score correlates significantly with grades when 
it is determined from the Semantic Differential and 
not when it is determined from the rankings of con- 
cepts. In both studies, the Professor-Predicted 
Score correlated with a measure of intellectual abil- 
ity. 

As far as the major purpose of this study is con- 
cerned, it seems evident that our attempt to substi- 
tute a quick ranking procedure for the unwieldy 
Semantic Differential was only partially successful. 
The total difference in ranks assigned the fifteen 
concepts by instructor and student was correlated 
With grades, and for this purpose the ranking tech- 
nique appears adequate. However, the attempt to 
Obtain a valid Professor-Predicted score from this 
technique failed. Further studies should be done to 
determine if an adjective checklist procedure not 
involving ranks (and perhaps using different con- 
cepts) might generate adequate correlations between 
grades and ability to predict instructor values. In 
another context, Block found that an adjective check- 
list was simpler to administer and score and pro- 
vided more reliable information than did the Seman- 
tic Differential (2). 


Summary A group of 42 students enrolled in two 
general psychology classes was given the Otis In- 
telligence Test, the Gough E-F Scale, andtwo spe- 
cial procedures. The first involved ranking fifteen 
concepts as to their value for the individual making 
the ranking. The second involved re-ranking these 
concepts as the student thought his instructor had 
ranked them. The five students with the highest and 


the five with the lowest IQs were omitted from the 
sample. 

The results indicated that grades in psychology 
for the middle three-fourths of the IQ distribution 
were correlated with intelligence, the degree of sim- 
ilarity in values between a student and his teacher 
and withthe degree of democratic, non-ethnocentric 
values the student shows. The inter-correlations 
among the predictor variables were complex. 


FOOTNOTES 


1. The author wishes to express his appreciation to 
Professor James McGaugh for his time and effort 
in making this project possible. 

2. These concepts were: play, professor, college, 
cheating, good time, easy money, work, stud- 
ying, homework, books, grades, athletics, tests, 
Social activities and research. The instructor's 
value scores for these variables, based on his 
Semantic Differential, when converted to ranks, 
correlated . 75 with his ranking as described in 
the body of this article, which was done one y ear 
later. This correlation was lowered by his con- 
fusion over whether the single concept **profes- 
вог”” should be ranked in terms of his own values 
or from the student's point of view. 

3. Applying the usual test of significance yields a p 
value of<.01 (1 tail) for each of the confirmed 
prediction and<. 01 (2 tail) for the E-F Score-Otis 
correlation. 

4. For the complete group of 42 subjects the corre- 
lation between Student-Professor discrepancy and 
grades was -.26 (p<. 05, 1 tail). 
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Editor's Note: The above list is unusally long since this item has 
not appeared in the Journal for some time. The inclusion of a book 


in this list does not pr ec lude nor suggest that the book will be re- 
viewed in a subsequent issue. 
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Difficulties in Developing Skill in Critical Thinking 


MUCH HAS BEEN written concerning the import- 
ance of developing skill in criticalthinking. Appar- 
ently everyone agrees that teachers should devote a 
considerable amount of time to developing such skill. 
Unfortunately, however, many investigators have 
found that the schools are not developing skill in 
Critical thinking as they should. It seems possible 
that one of the major reasons why some of the pro- 
cedures designed to develop skill in critical thinking 
are not being used by many high school teachers is 
that they are extremely difficult to use on the sec- 
ondary-school level. Inorder to determine whether 
this assumption was correct, the author devised a 
questionnaire containing the procedures for devel- 
oping skill in critical thinking which were recom- 
mended by many authorities. The specific purposes 
of the questionnaire were to determine (1) which 
procedures teachers consider to be the most difficult 
to use, (2) why teachers believe that certain pro- 
cedures are the most difficult touse, and (3) whether 
there are any significant differences in the difficul- 
ties reported by various groups of teachers. 

Because one of the principal purposes of the 
course in Problems of Democracy is to develop skill 
in critical thinking, it was believed desirable to de- 
limit the study to teachers of the problems course. 
The questionnaire was completed by 143 teachers of 
the problems course in California secondary schools. 

The procedures for developing skill in critical 
thinking are ranked in Table I according to the de- 
Sree of difficulty of using the procedures reported 
by the teachers. The rank order scale was deter- 
mined by ranking the responses of the teachers ac- 
cording tothe following weighted values assigned to 
the four possible responses: three points for the 
“MostDifficult” response, two points for (һе“Мой - 
erately Difficult’? response, and one point for the 
“Least Difficult’? response. 

An examination of Table I reveals an important 
difference between the procedures which were re- 
ported to be the most difficult to use and the pro- 
Cedures which were reported tobe the least difficult 
to use. The procedures which were reported to be 
the most difficult to use require higher levels of 
Critical thinking on the part of the students-- to an- 
alyze, interpret, and evaluate information; to 
evaluate sources of information; and to de- 
termine the most reasonable and logical 
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conclusions. The procedures which were reported 
to be the least difficult to use are the procedures 
for gatheringinformation, and most of them require 
only a passive type of thinking of learning on the part 
of the students--to gain informationfrom audio-vis - 
ual aids, to gaininformation by listening to others. 
This does not mean that such activities are without 
value, onthe contrary, they are essential procedures 
for developing skill in critical thinking and for solv- 
ing problems. However, they are only preliminary 
Steps in critical thinking, not ends in themselves. 

The reasons given by the teachers who indicated 
that they believe ** Teaching studentsto analyze, in- 
terpret, and evaluate information" is the most dif- 
ficult procedure or stepindeveloping skill in crit- 
ical thinking are categorized in Table II. 

The fact that 9. 3percentoftheteachers indicat- 
edthat they do not have sufficient time to teach stu- 
dentstoanalyze, interpret, and evaluate information 


is GLO If almost one out of every ten teach- 
ers of theproblems course, which has as one of its 


principal purposes the development of skill in crit- 
ical thinking, feels that he does nothave time to 
teach students to analyze, interpret, and evaluate 
information, what percent ofteachers in other sub- 
ject matter areas feel that they do not have time to 
develop skill in critical thinking? 

Over 28 percent of the teachers indicated that 
lethargy and lack of interest on the part of the stu- 
dents make it difficultto teach them to analyze, in- 
terpret, and evaluate information. Although students 
are not vitally interested in analyzing, interpreting, 
and evaluating information concerning all problems, 
the careful selection of problems in terms of the 
needs, interests, and abilities of the students should 
do much to eliminate lack of interest and lethargy. 

In orderto ascertain whether the difficulties re- 
ported by the teachers are attributable to facto rs 
over which the pre-service and in-service teacher 
education programs can exercise control, the dif- 
ficulties reported by various groups of teachers were 
compared. 

Most of thedifferences which were found in com- 
paring the difficulties reported by various groups 
of teachers were expected and are attributable to 
such factorsas differences in education, experience, 
class loads, size of school, and location of school. 
However, onedifferenceis quite surprising. Teach- 
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TABLEI 


PROCEDURES FOR DEVELOPING SKILL IN CRITICAL THINKING RANKED 
ACCORDING TO THE DEGREE OF DIFFICULTY 
OF USING THEM REPORTED BY TEACHERS 


Rank Rank Order 
Order Procedure Scale 

1 Teaching students to analyze, interpret, and eval- 
uate information 267 

2 Teaching students to evaluate sources of inform- 
ation 250 

3 Teaching students to determine the most reason- 
able and logical conclusions 240 
4 Motivating students 235 

5.5 Getting students to recall known information and 
К to determine the need for more information 233 

5.5 Having students evaluatetheir decisions, proces- 
Ses, and outcomes 233 

gb Teaching students to organize and summarize іп- 
formation 232 

7.5 Providing opportunities for the students to act on 
their conclusions 232 
9,5 Having students set up tentative solutions 218 

9.5 Getting students to secure information by reading 
and using library resources 218 

11 Utilizing group planning to determine what infor- 

mation is needed, how to secure that informa- 
tion, and what procedures to use 212 

12 Getting students to accept delegated responsibil- 
ities 206 

13 Getting students to secure information by inter- 
viewing 197 

14.5 Selecting problems in terms of the needs, inter- 
ests, and abilities of the students 192 

14.5 Selecting problems in terms of the resources, 

needs, and demands of the school and of the 
community 192 

16 Getting students to secure information by listen- 
ing to others 190 

17 Recognizing and clarifying problems through dis- 
cussion of questions raised by the students 165 

18.5 Getting studentsto secure information from field 
trips 146 

18.5 Getting students to secure informationfrom audio- 
visual aids 146 
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TABLE П 


REASONS GIVEN FOR DIFFICULTIES IN TEACHING STUDENTS TO ANALYZE,INTERPRET, 
AND EVALUATE INFORMATION AND THE PERCENT OF 
TEACHERS WHO GAVE EACH REASON 


Reason 


о 
o 
5 
- 


Lack of previous training ог experience in analyzing, interpreting, 


and evaluating information 
Lack of interest 
Accept the printed word as gospel 
Lethargy 
Immaturity 
Limited or insufficient time 
Emotionalism 
Lack of materials 
Poor reading ability 
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erswiththe most experience reported that it is sig- 
nificantly more difficult to teach students to deter- 
mine the most reasonable andlogical conclusions 
than teachers with theleastexperiencedid. The dif- 
ference may be attributable to the fact that teacher- 


education institutions have given more attention in 
recent years to the problem method of teaching and 
to the need for developing skill in critical thinking. 
If so, the implications for both pre-service and in- 


service programs are great. 
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A Standardized Manuscript Scale for Grades 1, 2, and 3 


THE CLASSIFICA TION of research in handwrit- 
ing generally falls into one of these categories: (1) 
the general nature of handwriting programs, (2) 
handwriting and its effect on other school subjects, 
(3) handwriting styles and materials, (4) teaching 
techniques, and (5) the measurement of handwriting. 
Of these, the measurement of handwriting has al- 
Ways been of prime concernto the elementary 
School teacher. 


Philosophy of Measurements 


Educational measurements establisha part of the 
foundation of our educational philosophy. Without 
measurements, which give qualitative and quantita- 
tive accuracy, one cannotascertain the progress of 
а student's learning. 

Among the measurable learnings of children is 
the ability to make written symbols which, when 
put together, comprise one basic form of communi- 
cation, Students of handwriting early realized that 
the most valid and reliable criteria for measuring 
and evaluating children's handwr iting were hand- 
Writing scales. As early as 1916, Johnson said “А 
Scale gives the basis for a common unde rstanding 
and accuracy in judgment. ''* Since then, numerous 
handwriting scales have been constructed. 


Need for a New Scale 


Out of the score of handwriting scales evaluated, 
Some were outstanding in certain respects. Others 
Were grossly inadequate. It is through these eval= 
ов and criticisms that one constructs a better 

is 

The shortcomings of the scales which propose to 
measure manuscript make itevident thata new 
Scale is needed. The new scale should meet sever- 
al requirements. In certain instances, these re- 
quirements are the same as those-setforth by Free- 
Шап, which were adhered to in the constructio 

е Evaluation Scale. In other instances, the re- 
duirements go beyond Freeman’s specifications. 


— O КИЫНЫ 
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* Johnson, George L., “Measuring the Quality ona 


1916), p. 302. 
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The Requirements 


1. The new scale should cons ist of a series of 
scales, one for each grade, that measures at least 
five degrees of quality at each grade level. 

2. The new scale should bea measure of rate as 
well as quality. 

3. The general criteria for the selection of 
specimens should be legibility: color, size of let- 
ters, slant, letter spacing, beginning and ending 
strokes, word spacing, alignment, andletter forms. 

4. Specimens for the new scale should be selec- 
ted from a large sampling and the samples should 
be representative of children's handwriting through- 


out the United States. 
5. Specimens for the scale should be selected 


through a recognized statistical technique that al- 
lows itself to render validity and reliability to the 


method of standardization. 
6. The results of the measurement should lend 


itself to the present day methods of reporting pupil 
progress. 

7. The new scale should show overlap in quality 
among the grades. 

8. The new scale should be cross-validated 
with a cursive scale. 


Plan of Action 


Sampling.. Random stratified sampling was em- 
ployed to select 130 schools throughout the United 
States. Letters were sent to the superintendent of 
the selected schools, asking their cooperation in 
securing manuscript handwriting samples. Direc- 
tions for giving the test were sent to the cooperat- 
ing Schools, and these tests were administered dur- 
ing the month of April, 1956. 

Scoring for Rate. Those samples were discard- 
ed in which directions had evidently notbeen follow- 
ed. The three tentative sets of rate-norm tables, 
one for each grade, were based upon the letter 
count in terms of letters per minute of the 7,212 
samples. The rate norms are expressed in raw 


ndwriting", Elementary School Journal, XVI (February, 
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Scores, percentiles and T-scores. 

Scaling for Quality. Fifty samples from each 
grade were selected by a systematic plan to assure 
normality in their distribution. The fifty samples 
in each of these three sets were identified by num- 
ber and handed to judges for ranking with full in- 
Structions as to procedure. The judges consisted 
of teachers from the particular grade level from 
which the samples came. Ineach grade, those 
samples that received a designated meanrank were 
assigned a level of quality and appear as the scales 
of quality. 

The three tentative quality rate-norm tables,one 
for each grade, were based uponteachers' opinions 
as to what elements constitute the five quality levels 
of manuscript writing. The quality norms are ex- 
pressed in raw scores, percentiles and T-scores. 


Overlap in Quality. The overlap in quality of 
writing by the grades is very striking, but natural, 
There is a four-step overlap in the quality of writ- 
ing between Grades 1 and 2, and 2 and 3. 

Cross Validity. The proposed manuscript scales 
were cross-validated with a cursive scale. Eighty- 
four percent of the teachers in Grades 1, 2 and 3 
noticed that the manuscript writing was one quality 
level better than the cursive writing. 


E 
Conclusion 


These standardized scales afford a means where- 
by the teacher, the pupil and the administrator may 
evaluate with a high degree of accuracy both rate 
and quality of manuscript handwriting. 


"—— 
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Analysis of Mental Processes as a Preliminary 


Stage in Test Validation 


ACHIEVEMENT and psychological test construc- 
tion have been somewhat handicapped by the empha- 
sis on objectivity. It would be rather interesting to 
hypothesize why the workers in the field of mental 
testing have felt it so necessary tolimittheir meth- 
ods of investigation so inflexibly. One explanation 
might be that in our attempt to be accepted by the 
prestige disciplines of physics, chemistry, sociol- 
ogy, psychology, their methods were utilized. An- 
other explanation might be that the objective tech- 
niques were more readily recognized as being im- 
mediately useful. Whatever the explanation, the 
improvement of mental testing has been hindered by 
this emphasis. 

As is pointed out in the Review of Educational 
Research: ‘‘Most constructors and many users of 
educational achievement tests are aware of the fun- 
damental problems involved in the creation of valid 
tests and the provision of evidence on test validity. 
Hence, it is somewhat surprising and disappointing 
that there have been so few clarifying discussions 
of the complex problems associated withtest valid- 
йу.” (3) 

Presently testing procedures are still largely 
limited to statistical investigations or to operational 
definitions of the type of behavior to be measured. 
This author would like to suggesta different avenue 
of approach for a preliminary step inthe determin- 
ation of validity than is commonly explored. This 
approach was explored and reported upon in detail 
in a dissertation submittedto the Department of Ed- 
ucation, University of Chicago (2). 

This approach involved analyzing the mental рго- 
cesses of the subjects while taking an examination. 
This is, of course, a move in the direction of a 
method which might be considered sub jective, in- 
trospective, non-scientific; however, itp roved 
fruitful. The basic notion back of this approach was 
that we can never be certain that a test situation is 
actually appropriate to aparticular definition of be- 
havior until the actual behavior of the subject, when 
confronted with the test situation, is obtained and 
analyzed. To do this most directly involved obser- 
vations and analysis of the thought processes re- 
quired in answering the examination items. 


LOUISE L. TYLER 
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This preliminary stage of test validation was ex- 
plored in the development of achievement tests for 
advanced students in education. Briefly, the gener- 
al steps followed in the test construction were: 

1. Securing of explicit definitions of the behav- 

ior to be measured. 

2. Developing test situations which presumably 
were appropriate. 

3. Presentingthetest situations to the subjects 
who reported the mental processes used in 
answering items. 

4. Using expert opinion to determine whether 
the behavior evoked by the test situation and 
reported did reflect the defined behavior 
which the situation was to have measured. 


Securing of Explicit Definitions 


The examination as constructed included seven 
sections corresponding to the seven elements iden- 
tified by the analysis of content; that is, there was 
a section on the Individual Learner, Society, Philo- 
sophy, Administration, Curriculum, Evaluation and 
Personnel Services. Also, within each of these 
seven sections, the test items involved (һе behavior 
of understanding—that is, stating, recognizing, re- 
calling; that of abilities—that is, being able to for- 
mulate a theory, being able to assess a theory, etc. 
(6). 
The desirability of these definitions of the behav- 
ior to be measured was checked for significance by 
the Deans of Education of the Big Ten Universities. 
In addition, a sample list of concepts as part of the 
content analysis was also checked for significance. 
Minor revisions only were suggested. Therefore, 
it was believed that the rationale formulated for the 
identification of content and behavior was sound. 


Developing Test Situations 


The procedures utilized for the development of 
test exercises followed the general pattern of se- 
lecting test situations which, on the basis of previ- 
ous experiences and face validity, required the use 
of the specified content and behavior. 
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Presenting the Test Situations to the Subjects 


As the validity of an item required determining 
whether the item measures what it purports to mea- 
sure, the following conditions then basically should 
be fulfilled in the case of each item: 

1. Any subject who obtains an adequate score 
properly carries on the type of mental activity 
required by the item, that is, he uses the re- 
quired content and behavior. 

2. Any student who obtains an inadequate score 
does not properly carry on the type of mental 
activity required by the item, thatis, he does 
not use the required content and behavior. 

In order to determine whether these conditions 
were fulfilled, the following information had to be 
ascertained for each item. Тһе first kind of infor- 
mation necessary was a statement as to what objec- 
tive the item was supposed to test for. The second 
kind of information needed was adescription of what 
the student actually did in tryingto dothe test item. 

The third kind of information needed was a judgment 
as to whether the student possessed the requisite 
knowledge when a test item required both knowledge 
and application or use of thatknowledge. Thefourth 
type of information needed was a judgment as tothe 
Score or grade made on the items. 

The tabulation sheets used were sét up as shown 
at the bottom of this page. 


АП of these judgments were made independently 
by the author and the Chairman of the Department 
of Education, University of Chicago, by analyzing 
the transcriptions of the student's reported thoughts. 
For each test item, these four judgments had to be 
similar. Of the 479 responses obtained from the 
Seven sets of examinations which were analyzed by 


(Name of Judge FNUPERFUQ 


the judges, 459 of them met the conditions for valid- 
ity just described. That is, 96% of the responses. 

While it is clear that this technique of analyzing 
mental proces: 26 as a preliminary stage in test val- 
idation is time consuming and expensive, it is an 
important and essential step. 
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PRACTICE TEACHING can play a major roleinthe development of a student teacher's 
attitudes toward children, and the change may not be for the better, Dutton be- 
lieves after discovering that anxiety level—a studentteacher's self-confidence 
or lack of it—can effect her attitude toward pupils in surprising ways. Ina 
study of 91 elementary school student teachers at UCLA, 80 percent of those 
judged to be **highly anxious’’ changed their attitudes toward children іп a neg- 
ative direction; but so did 80 percent of the practice teachers rated most 
*mnon-anxious."' Meanwhile, education students not engaged in practice teach- 
ing maintained their high positive attitudes toward children. “Тһе study sug- 
gests practice teaching situations may be unduly stressful, ’’ Dutton says. 
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Assuming а common denominator of minimum education and 
i training, the big difference between teachers as measured by pupil 
1% achievement is А DEEP-SEATED INTERPLAY OF TEACHER- | 
CHILD PERSONALITY CHARACTERISTICS. Sostate Heil and Wash- 
burne in an interim report on their Brooklyn continuing study into 
teacher effectiveness. Тһе researchers have been able to identify 
a type of teacher whose students typically show Superior gains. Her 
outstanding personality characteristics add up to “а1еадегвһір role 
coupled with work orientation. ° Conventional teacher observation 
ratings and teacher education exams do not distinguish this type of 
teacher from her less effective colleagues, the authors say. 
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VARIOUS RESEARCHERS have reported a correlation between veteranteacher scores | 
on the Minnesota Teacher Attitude Inventory (MTAI) andthe Guilford- Zimmerman 
Temperament Scale (GZTS). То see if the same thing holds for student | 
teachers and beginning teachers, Cook analyzed appropriate data and found 
that while beginning and student teachers show similar relationships be- 
tween MTAI scores and each of the various GZTS traits, these correlations are 
nowhere near so marked as in the case of experienced teachers. His findings 
suggest (a) that actual personality changes may take place with increased par- 
ticipation in professional work, and (b) that neither the MTAI nor the GZTS 
сап be employed successfully as an initial selection device because they do not. 
discriminate sharply enough at the prospective-teacher level. 
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KINDERGARTEN EXPERIENCE helps first-graders achieve average or better Social 
marks, but it doesn't make any difference whether the kindergarten was in the 
United States or in Europe, Drasgow and Stockin report. At least at the pre- 
school level, there is little difference between American and European educa- 
tional influences, the researchers say. 
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А rash of commissions, boards, studies, and reports has 
grown up in recent years around the r evolution in the teaching of 
mathematics. Smith presents a summary of the literature which 
can be of assistance to the high-school math teacher. 
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WHICH IS MORE EFFECTIVE IN COLLEGE TEACHING, the mass lecture-quizz sec- 
tion combination or the seminar-type class? In terms of teaching the funda- 
mentals of speech to students at the University of Kansas, there is no statisti- 
cally significant difference between the two methods, Giffin and Bowers report. 
Both approaches produced positive changes in knowledge. “ІК appears that the 
mass lecture method of teaching can be employed without diminishing the 
amount of learning," the researchers conclude. 
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WHICH IS MORE EFFECTIVE IN COLLEGE TEACHING, regular lecture-discussion 
a textbook, or periodic meetings built around extensive 


meetings built around 

“outside reading" ? In terms of teaching general psychology to students at the 
University of Nevada, the conventional method is better, an exploration by 
McQueen suggests. While superior students did wellunder both systems, av- 
erage students demonstrated a considerably greater proficiency as their in- 


H 22 
structor contacts increased. BA xx 


If it is important that teachers “ее things from the students’ 
point of view," how accurately can teachers, in fact, predict high- 
school pupils’ feelings on a variety of particular issues? Not too 
well, according to a Pennsylvania study by Frymier. His teachers 
correctly estimated adolescent response to ten items, but they 
missed the boat on ten other items. The experiencedteachers did no 
better than a control group of college students. 
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to teach at the elementary level have differ ent 
teachers aiming at the secondary level? Def- 
Psychologists have long agreed that conscious 
ence occupational choice. Now Scandrette re- 
between elementary and secondary student teach- 


DO WOMEN STUDENTS, preparing 
**need patterns” from student 
initely yes, says Scandrette. 
or sub-conscious needs influ 
ports significant differences 


Continued on page 35: 
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Brooklyn College Research in Teacher Effectiveness 


LOUIS M. HEIL and CARLETON WASHBURNE 


Brooklyn College 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 8, May 1962) 


During the past eight years the Office of testing 
and Research at Brooklyn Collegehas been at work 


AERA, C. Washburne presented a Design for Long 
in this field, concentrating on the first three of the 


Range Research in Teacher Education. 1ж Несоп- 


EIGHT YEARS AGO, atthe Boston meeting ofthe к 
1 


cluded that report with the following paragraphs: 


We want to measure the growth that takes 
place under the most ейеспуе teachers in 
contrast with that which takes place, or 
fails to take place, under the least effec- 
tive teachers, in a variety of ways and in 
children of widely varying bac k grounds. 
We want to discover what patterns of teach- 
er behavior and understanding distinguish 
the more effective teachers from the less 
effective ones. We want to know what types 
of persons entering our colleges are like- 
ly to achieve these characteristics if given 
the most effective learning experiences we 
can devise. We want todiscover what these 
most effective learning experiences аге SO 
that we сап evaluate and improve our pro- 
grams of teacher education. We want to 
test our products before they actually take 
responsibility for educating children and 
youth. And we want to submit the whole 
program to the ultimate test of whether or 
not teachers whohave been through it ac- 
tually help the boys and girls entrusted to 
them to grow effectively in the achievement 
of the values which we agree are necessary 
for the fulfillment of their potentialities 
as persons and as contributing and re- 
sponsible members of society. 

The jobis big enough to engage the en- 
ergies of all of us over many years. Each 
step of the way is in itself worthwhile. 
The goal is the goal of education. 


SOE OS M Uu dues o с^ 
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Footnotes will be found at the end of the article. 


above questions as basic to the remaining ones. 

The problem chosenfor investigation has been 
that of identifying different types of children a nd 
finding ways of measuring their growth under dif- 
ferent kinds of teachers. It has therefore been ne- 
cessary to find or develop instruments for classi- 
fying different categories of children and different 
types of teachers, and then measuring the kinds and 
amounts of growth that take place inthe children of 
each category under teachers of each type. Our 
hypothesis is that certain changes do take place in 
children under the influence of teachers and that 
there is a relationship between these changes and 
the type of children and teachers. 


Classifying Children 


For categorizing children and measuring their 
growth the following instruments were used: 


a. The Otis Quick Scoring Intelligence Test, 
Elementary Form 

b. The Stanford Achievement Test, Elementa- 
ry for grade 4 and intermediate for grades 
5and 6 (omitting the section on study skills) 

c. The Ohio Social Acceptance Scale 

d. The Brooklyn College Test of Children’ s 
Feelings 


The last-named measure, since it had not been 
published as yet, needs a brief explanation. After 


- several years of study and experimentation, a se - 


ries of short dramatic episodes were prepared 
(with the help of Professor Paul Williams of the 
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Brooklyn College Department of Speech and Thea- 
tre), designed to call forth projective responses from 
children who hear them оп a tape recording. T he 
tape indicates exactly how the children are to indi- 
cate their responses so that they can be machine 
Scored. The tapes are played to an entire class. 
Statistical analyses of various trial runs, before the 
test had been used in the actual experimentation, 
Showed that there are identifiable clusters of traits 
which usually are found together in any onechild, 
and that most children can be rather clearly classi- 
fied under one of four categories. 


Classifying Teachers 


The following instruments were used for identi- 
fying different types of teachers: 


а. The Teacher Education Examination Pro- 
gram (Educational Testing Service) 

b. The Manifold InterestSchedule (Heil-Shevia- 
kov-Stone). Thisinstrument gives an index 
of Academic Interests and Personality. 

c. The Brooklyn College Teacher Observation 
Forms. Thesehave beendeveloped with de - 
tailed instructions as to kinds of behavior to 
be observed in teachers and method of rat- 
ing each kind. 

d. The Brooklyn College Interaction with Chil- 
dren Test. This is in some ways analogous 
to the Brooklyn College Test of Children's 
Feelings (d above), in that it uses tape re- 
corded dramatic episodes to which the re- 
Spondent gives reactions partly projective in 
‘character. Unlike the test for children, how 
ever, the responses are open-ended and аге 

recorded on tape with an 8-second time in- 
terval for each response. 


The Current Experiment 


Having decided upon or developed the above- 
instruments, further experimentation was conduct- 
ed.3 The child population consisted of 55 classes 
of children. One third are indistricts oflow socio- 
economic status; one third in districts of interme- 
diate status; and one third in districts of relatively 
high socio-economic status.4 The children were 
in grades 4,5and6. We wantedto measure changes 
in children as related to the influence of teachers; 
and only in the grades below juniorhigh schoolis it 
possible to get children who areunder the influence 
of one teacher all day and usually all year. We 
avoided the primary grades so that we would have 
a population that could read and write adequately. 
We used Brooklyn because it was a widely diversi- 

fied city of nearly three million and conveniently 
available. 

The teacher population consisted of two teachers 
of each of the three grades in nine large public 


Schools. Theprincipal іп each school was instructed 
to select, foreach grade, one teacher whom he con- 
sidered to be pretty strong and one whom he con- 
Sidered not so good. Neither was identified. The 
results show that range of mean pupil achievement 
was from 0.3 to 1.85 years in a 7 month interval. 

The children weregiventhe Stanford Achievement 
Test and the Ohio Social Acceptance Scale near the 
beginning and again (with different forms) near the 
end of the academic year-about seven months apart. 
They were giventhe Otis Intelligence Test about the 
middle of the year, and the Test of Children's Feel- 
ings in March. 

The teachers were given the Teacher Education 
Examination at the end of the year (41 took it, 14 did 
not). The ManifoldInterest Schedule was adminis- 
tered to the teachers at the same time. Forty - опе 
teachers took the Brooklyn College Interaction with 
Children Test. 

All teachers were observed (using the Brooklyn 
College Teacher Observation Forms) repeatedly 
during the year, foratotal of six to eighthours each, 
by trained observers.9 These observers hadprevi- 
ously worked inteams with these forms, making ob- 
Servations of other teachers and comparing scores 
until they had achieved a high correlation in their 
Scoring. 


The Manifold Interest Schedule 


The Interest Schedule consists of 420 ‘ activity 
items” divided among thirty categories, of which 
eighteen are non-academic and twelve are academic. 
Six of the categories are represented in each of five 
Subschedules, the items within each subschedule be- 
ing scrambled.6 The subject is asked to respond 
to each item with “‘like,’’ '*indifferent," or ‘‘dis - 
like," — only the likes and dislikes being scored. 
Items include such statements as the following: 


Cleaning up dishes after a meal 

Taking off shoes at the movies 

Seeing a good fight 

Learning about business routines and systems 
Tracing back the origins of certain words 


The raw score is obtained by subtracting dislike 5 
from likes, for each of the thirty categories. These 
scores are Statistically adjusted upward or down- 
ward to compensate for an overall tendency on the 
part of an individual tohave anexcessive proportion 
of dislikes ог likes respectively. А profile is then 
made for each individual, showing ona stanine scale 
his like-dislike adjusted score for each of the eight- 
een non-academic categories, and, separately, for 
each of the twelve academic categories. 

It would appear that there are clustersof traits 
which tend toward linkage with each other on such 
profiles. It has been possible to identify six such 
clusters and to prepare a type profile for each. A 


HEIL - WASHBURNE 349 


way has been devised to measure any individual pro- 
file as to closeness of approximation to eachof these 
six type-profiles known as А, B, C, D, E and F. 

In our experiment the profile of each teacher was 
thus compared and the teachers were thereby classi- 
fied as type A, B, C, D, E, or F. While compari - 
sons, as described below, were made for each of 
the six, the number of teachers, when thus subdivid- 
ed, was too low to be meaningful in certain types . 
It was possible, however, to do some combining. A 
teacher's profile might be closest to Type E, for 
example, but almost as close to Type C. She could 
therefore be combined withthe Type C teachers. In 
this manner we have reduced our data for the teach- 
er types to A, B and C: A ( turbulent, impulsive, 
variable), B (self-controlling, orderly, work ori- 
ented), and C (fearful). 


Preliminary Tentative Results 


l. The most striking and significant result in 
terms of the growth of all four categories of children 
in regard to their academic progress as measured 
by the Stanford Achievement Test is that children 
under teachers of Type B made significantly more 
progress than those under teachers of TypeC, Type 
A falling in between. Type B teachers appear to be 
іп between. Type B teachers appear to be especi- 
ally effective with “орровіпе”” and ‘wavering’ chil- 
dren - probably because of their clarity and struc- 
ture regarding work to be accomplished. 

2. The data reveal that Teacher Type A (turbu- 
lent, impulsive, variable), obtains quite different 
results with different kinds of children. This type 
of teacher obtains little growth by children classi- 
fied as **opposing'' and as *twavering." Нег own 
inherent interest in science and mathematics, how- 
ever, result in superior gains by her pupils in these 
fields. The data suggest varying children's ga ins 
depend basically upon the teacher-child combination 
of personality. 

3. The Type C teacher — fearful, anxious, un- 
Sure of herself -, gets less gain from her pupils 
academically than do Types А or В. 

4. The correlation between the mean class 
achievement of children and the teacher observation 
ratings is negligible. 

5. The correlation between the mean class 
achievement and teacher scores on the Teacher 
Education Examination is negligible. 

6. The correlation between the teacher observa- 
tion ratings and teacher scores on the Teacher Edu- 
cation Examination is significant and of the magni- 
tude 0. 50. 

What does all this mean? It apparently means 
that the clear overriding factor in determining chil - 
dren's academic achievementis adeep-seated teach- 
er personality. It also appears to mean that, while 
the Teacher Education Examination and the teacher 
observation results are consistent between them- 


selves, the kind of information and teacher behavior 
generally considered conducive to children’s learn- 
ing have little or no relation to such learning. In 
this connection, however, it must be remembered 
that all the teachers had a common base in that they 
were college graduates, had taken the usual courses 
of professional preparation, had been approved in 
their practice teaching, and had successfully passed 
the city licensing examination. They had, therefore, 
at least a minimum of professional knowledge and 
approved teaching behavior. Beyond that minimum, 
differences in knowledge and behavior seem to have 
little relationship, if any, to children's achievement , 
the one important factor being the teacher's own 
personality characteristics. Toparaphrase Emer - 
son, “1 cannot hear what you say when what you are 
thunders so loud. ” Ял TES 

7. The Children's Feelings Test was not admin- 
istered until March; thus there is no pre-test post- 
test comparison. Since, however, the children had 
been under the same teacher for seven months when 
tested, it occurred to us that, ifteacher personali- 
ty had a measurable effecton the pattern of the chil- 
dren's feelings, and assuming that the various pat- 
terns of children's feelings were distributed mor e 
or less evenly among the teachers at the beginning 
of the year, we might find that the proportion of each 
kind of pattern varied, after seven months of the 
teachers' influence, according to type of teacher 
personality. 

We are finding, particularly with children on the 
negative and hostile side, that with the B type teach- 
er who is goal setting and work oriented, such chil- 
dren tend to develop more positive feelings, such as 
a greater perception of authorityfigures as accept- 
ing and that such children haveless anxiety. This 
type of teacher, however, had been rated slightly 
lower than the other types on the teacher observation 
Scale. Thus it appears that, while on one hand the 
observers felt such teachers, the goal setting and 
work oriented ones, tended to be stricter and less 
permissive than the A type and eventhan the C type, 
the effect of such teachers on the children was to 
evoke positive feelings and greater achievement. 
The finding suggests a needto reexamine the whole 
concept of permissiveness and security-giving as 
commonly interpreted. 


Social Acceptance Results 


The evidence reveals that the Btypeteacher ob- 
tains significantly more academic achievement than. 
either the A or C type teacher. The question arises 
now regarding the effect of such results in relation 
to the children's feelings toward each other. For 
example, it may be hypothesized that the B type 
teacher may be promoting more competition among 
the children and hence less friendship. The results 
of the Ohio Social Acceptance Scale reveal, howev- 
er, that children with the B type teacher become 
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markedly and significantly more friendly toward 
each other than do children with either the A or C 
type teacher. This may wellbedue to the structur- 
ing and orderly tendency ofthe Btypeteacher, giv- 
ing a greater senseof security to the children. 
While there is a significant negative correlation 
between class size and increase in scores on the 
Ohio Social Acceptance Scale, a correction for this 
does not alter the fact that children gain more under 
the influence of B type teachers than under the other 
types. Thisis true for all four categories of children. 


Characteristics of Teacher with Successful 
Personality Type 


Since the type of teacher personality as indicated 
on the Manifold Interest Schedule seems to be the 
major factor in influencing the progress ofthe chil- 
dren, let us contrast the characteristics ofthe suc- 
cessful Type B teacher with those of the much less 
successful teacher of Type C. Неге we shall dis- 
cuss only the items where the two types differ; in 
a number of other items they coincide. 

In the category of Human Relations, the B teach- 
eris somewhat higher in “Leadership” - interest 
in organizing and directing others either directly or 
behind the scenes. (Power drive.)' She is higher 
in “бате sex" — interest in social, educational and 
athletic activities in which only the same sex is 
usually involved. Andshe is muchhigher in ‘‘Iden- 
tifying with ойһегв”” — interest in ot her people's 
thoughts, feelings, actions and problems in the form 
of intellectual activity, e.g., by reading and by 
direct participation or helping others. 

In the category of Fantasy Life, Type Bis lower 
than Type C in **Magic" — interest in the use of 
Supernatural and magical symbols and processes 
Such as charms, rituals and superstitious beliefs. 
(Practicing magical and wishful control of reality.) 
Туре B is not so low in **Mystery" - interest іп 
Secrets and the origin of things as represented in 
intellectual fields and social taboos. Bis definitely 
lower in *Humor" - interest in slapstick, puns, 
jokes, comics and other similar forms of humor 
(suggesting a form of hostility). She is much high- 
er than Type C in “Dramatics” - interest in the 
theater arts both inthe form of individual perform- 
ance and “‘behind the всепев” activity. Too, type B 
is not quite so lowas C in * Life -Death-Universe"- 
interest in the beginnings and ends of life and in the 
cosmos (reflecting aneed to externalize anxieties 

and put them on a cosmic scale where they can be 
dealt with philosophically). 

In the category of organizationofDrives andIm- 
pulses, the Type B teacher is a little higher than 
Type C in “Ргеоссирайоп with Cleanl iness'— 
concern for cleanliness and purity and a fear of 
contamination. She is much higher in “Method ical "— 
interest in rigidly patterned and repetitive activity. 
(An urge for compulsitivity and ritualization.) But 


by far the greatest contrast between Types B and C 
is іп B's extremely low degree of “ Acceptance of 
Impulses" and ‘‘Aggression.’’ ‘Acceptance of Im- 
pulses" means interest in doing things on the spur 
of the moment and letting oneself go (reflecting a 
tendency to follow hostile and libidinal impulses). 
“Арвгеввіоп”” means interestin doing things which 
may harm or irritate others either by physical or 
verbal means (reflecting an expression of hostile 
and aggressive tendencies ога lack of fear of one’s 
aggressive impulses). 

What is the sum of these characteristics ? It ap- 
pears that the B type teacher is reasonably warm 
and empathic with others, but in contrast tothe Type 
C teacher she is not nearly so fearful about how 
others feel toward her. The most outstanding char- 
acteristic of the B type teacher, however, appears 
to be a leadership role coupled with work orienta- 
tion. The B teacher is clearly self-severe, method- 
ical, and is likely to set standards forherself and 
see that she achieves such standards. Because of 
her leadership interests she also does the same 
thing with her pupils. 

The essential point is that there are identifiable 
types of teachers and that different types have dif- 
ferent effects on the children whom they teach; cer- 
tain ones seemingly aremuch more generally effec- 
tive than others. 


FOOTNOTES 


1. Printed in summary in the May, 1953, issue of the 

Journal of Educational Research, Vol. 46, pp. 
1-715. 

2. We һауе carried on a series of experiments and 
investigations. For several years we sought 
for researchgrants, but failing to get them, we 
nevertheless went on with our studies, within 
our limited research budget which had to be 
used at the same time for much institutional 
and other research work at the college. Finally 
two years ago, the U. S. Office of Educati on 
gave us a grantunder which we have been able 
to proceed at a greatly accelerated pace, fully 
using all the preliminary studies (Grant No. 
SAE 7285). 

3. Consultants for these investigations have been 
Professor Louis Raths of New York Univer- 
sity and Dr. Wayne Wrightstone ofthe Bureau 
of Research of the New York Public Schools. 

4. These children are all in the public schools 04 
Brooklyn. Wehavereasonto believe that the y 
are fairly typical of a cross section, not only of 
New York City, but of any large metropolitan 
center; and we have some indication—sugges- 
tive, not definite, —that the major factors of 
the experiment would not differ importantly in 
quite different settings; obviously, however,the 
experiment should be repeated under diverse 
conditions. 
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5. Professors Celia Baum, Natalie Darcy, Brenda sonality classification of teachers. 


Lansdown and Eunice Matthew of Brooklyn 7. This and the following descriptions are quoted 
College, and Bonnie McHolme of New York directly from A Manual onthe Manifold Inter- 
University. est Schedule, by Louis M. Heil, George A. 

6. In our experiment we used only the non-academic Sheviakov and Solomon Stone; Brooklyn Col- 
categories (three subschedules), forthe per- lege, 1959. 


IT SAYS IN JER continued... 


ers on the Edwards Personal Preference Schedule interms of ‘‘autonomy, d 
‘affiliation, ” “dominance,” and *aggression." In general, potential high- 


School teachers are more **normal."' 
ж k k * 
Do teachers differ from people? Not onthe basis of nage 
i ied to 77 experienced teachers in a University о 
metrio RE $ «Since no dominant charac- 


Oklahoma sample. Concludes Rupiper, | Г 
кеге T мү emerged from the results of standardized tests, it 
was felt that experienced teachers are not essentially different from 


people-in-general. ” 
жж ж ж 


w 


E Field News 


THE NATIONAL ASSOCIATION FOR RESEARCH IN SCIENCE TEACHING in coopera- 
tion with the Office of Education of the U. S. Department of Health, Education 
and Welfare assembles titles and abstracts of all research studies in science 
education. These studies are reported annually at the national meeting of the 
Association and are published biannually in a bulletin available from the U. S. 
Government Printing Office. АП professors of science education, directors 
of instruction or research inschool systems, directors of graduate studies in 
the universities, and others who may know of science education studies com- 
pleted in 1961-62, аге urged to report these to Paul E. Blackwood, U.S. Of- 
fice of Education, Washington 25, D. C. Upon request, special reporting 
blanks will be forwarded so that the author of the research can report the 
study directly. 


ж жж ж 


THE BOSTON UNIVERSITY DIVISION OF CONTINUING EDUCATION 
offeredaspecial seminar program, which beganon March 8, to explore 
employment opportunities for women. Theobjective ofthe seminars was 
to provide a survey of areas offering em ploy ment opportunities for 
women, to identify the specific nature of these opportunities, and to in- 
dicate the nature of the qualifications required. 


X EE 


A STUDY OF THE BEHAVIOR PATTERNS OF YOUNGSTERS and their parents has been 
undertaken in Eau Claire County in Wisconsin. This study is concerned with 
behavior of youngsters and parents in cases where the youngsters have been 
nominated by their teachers as consistently exhibiting approved or disapproved 
behavior. А totalof 384children, distributed evenly by thethree grade levels— 
third, sixth, and ninth—by urban or ruralresidency, and by approved or dis- 
approved behavior, will serve as subjects along with their mothers and fathers. 
Data will be gathered by a staff of professional socialworkers who will inter- 
view the father, mother, and child; rate the family on a number of variables 
including the Glueck Factors; administer the KD Proneness Scale , Situation 
exercises especially developed for this study and a special sentence comple- 
tionform. The hypotheses are concerned with urban-rural, approved-disap- 
proved, male-female, and grade level differences. 


ж жж ж 


THE FIFTH ANNUAL WORKSHOP ІМ READING RESEARCH was held 
in Chicago on April 17. Тһе purpose of the meeting was to encourage 
diagnosis and evaluation of learning in the field of reading and the use 
of those findings in reading research. The workshop consisted of a 
discussion by the audience andauthors of submitted papers selected for 
presentation, and a discussionof needed research inthe field of reading. 
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Teachers’ Estimates of Adolescents’ Responses to F-Scale Items 


The Problem 


CAN TEACHERS effectively estimate adoles- | 
cents’ feelings on any given issue? Many persons 
maintain that it is important for teachers to ** see 
things from the students’ point of view. ЖМ ш; шс 
paper reports a study of teachers' estimates of ado- 
lescents’ responses to various items of the F-scale, 
a measure of anti-democratic potential (1). As such, 
it represents an effort to determine how accurately 
teachers can predict high school students’ feelings 
on a variety of particular issues. 

Specifically, the following questions were ex - 
plored: 

1. How will adolescents respond to the various 

items of the F-scale? 

2. Will there be any significance to the way in 
which they agree ordisagree with particular 
items? 

3. Can teachers estimate how adolescents will 
respond to various items of the F-scale? 

4. Will there be any significance to the accuracy 
with which teachers estimate adolescents’ 
responses ? 

Scodel and Mussen (5) reported that low F-scale 
scorers were more accurate in estimating high F- 
scale scorers’ responses tothe measure than were 
high scorers in estimating the low scorers’ respon - 
Ses. Cohn (2) reported that a group of college stu- 
dents successfully faked responses to the F-scale, 
implying that they could estimate, with some degree 
of accuracy, the responses of certain kinds of per- 
sons. Frymier (3) reported that prospective teach - 
ers correctly estimated adolescents’ responses to 
certain items of the F-scale, but incorrectly esti - 
mated their responses to certain other items. Jen- 
kins and Lippitt (4) also reported differences in the 
interpersonal perceptions of teachers and students. 
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The Procedure 


In an effort to answer the questions described 
above, the F-scale was administered in the usual 
manner to536 high school students and 83 high 
school teachers in southeastern Pennsylvania. In 
addition, the teachers responded to the F-scale as 
they thought most high school students would re- 
spond. All responses were made anonymously. 

The sample of high school students included 280 
boys and 256 girls with a mean age of 14. 95 years. 
They represented 19 different Schools inthe **Penn- 
sylvania Dutch’’ country between Philadelphia and 
Harrisburg. 

The sample of high school teachers included 60 
men and 23 women with a mean age of 30.16 years. 
Further, this group had 3.63 mean years of experi- 
ence teaching and an average of 17.17 years of edu - 
cation. Most of the teachers were native Pennsyl- 
vanians, and all were teaching in various high 
schools in the Philadelphia and Harrisburg areas. 

The extent to which adolescents agreed or disa- 
greed with each item was computed by adding alge- 
braically all of the students' scores for each indi- 
vidual item. The various items were then listed in 
rank order according to the extent to which these 
young people agreed with them. Those items with 
which they agreed most were listed first, and those 
items with which they disagreed most were listed 
last. This listing was inspected for significance. 

Following this, the way in which high school 
teachers thought adolescents would respond to the 
various items of the F-scale was determined by 
adding algebraically the scores accorded individual 
items by the teachers in their estimates of high 
school students" responses. The teachers'esti- 
mates of adolescents' responses were then ranked 
in order of their agreement and compared to the ac- 
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tual way in which the high school students did re- 
spond. In comparing rank orders, those estimates 
by teachers whichdifferedless thanfive places were 
considered as correct estimates of adolescent ге- 
Sponse. Those which differed more than nine posi- 
tions were considered incorrectly estimated. That 
is, if the relative ranking according to agreement of 
the high school groupand the relative ranking of the 
teachers’ estimate of the high school group were 


less than five positions apart, the items were con- 


Sidered as having been correctly estimated; if they 
were more than nine positions apart, they were con- 
Sidered as having been incorrectly estimated. 


The Results 


The results of these procedures are recorded in 
Table I. As mentioned before, the F-scale items 
аге listed according to the way the adolescents 
agreed with the various items. That is, that item 
which is listed first represents the one with which 
these particular high school students agreed most. 
That item which is listed last is the one with which 
these high school students disagreed most. 

In effect, then, Table I represents a sortof value 
System as indicated by the way in which these ado- 
lescents agreed or disagreed with the various ideas 
involved. Likewise, the way in which the teachers 
thought adolescents would respond also represents 
à sort of value system for the particular teachers 
Studied. In other words, their estimates of young 
peoples' responses gives some indication of these 
teachers' perceptions of high school students and 
what they believe about these youngsters. Indirect- 
ly, atleast, this might be interpreted as a reflec- 
tion of these teachers' own value System. In addi - 
tion, of course, it also indicates those areas in 
which these teachers were accurate or inaccurate in 
their estimates of students' responses. 

For example, this particulargroup of high school 
teachers estimated rather accurately adoles c e nt 
agreement with items relative tothe following ideas: 
prying into personal matters, thelimitations of sci- 
ence, inevitability of war, not hurting relatives, and 
need for faith in some Supernatural power. Simi- 
larly, they estimated fairly accurately the extent to 
which adolescents tended to disagree with these 
items: lives being controlled by secret plots, need 
to protect one's self from disease because of differ- 
ent kinds of people, the idea that familiarity breeds 
contempt, the desire of some persons to jump from 
high places, and the idea that people must suffer to 

learn. 

On the other hand, these high school teachers in- 
correctly thought that young people would disagree 
with these items: young people getting rebellious 
ideas, the need for childrento obey and respect au- 
thority, respecting parents, youth's need for strict 
discipline, and comparison of sex life of ancient 
Greeks to goings-on today. Also, they incorrectly 


thought that young people would tend to agree more 
than they actually did with the following items: pun- 
ishing sex crimes, need to keep busy when one has 
a problem, punishing insults, importance of busi- 
ness man over the artist and professor, and divid- 
ing people into weak and strong classes. 


Discussion of Results 


Now, what do these data mean? First, inspect- 
ing the adolescents’ responses to the various F-scale 
items according to a descending order of agreement 
seems to indicate a hierarchy of values. For exam- 
ple, if one examines both ends of the rank order in 
Table I he gets the impression that these young peo- 
ple want and like mature assistance and direction, 
but that they also have confidence in themselves and 
their own potential. Further, they seem to feel that 
people are important; that everybody counts. This 
implies a kind of positive value system. 

On the other hand, it seems that this particular 
group of high school teachers accord young people, 
in part at least, a negative value system. They es- 
timated fairly accurately adolescent sentiment on 
about a third of the items involved, butthey also es- 
timated rather incorrectly adolescent sentiment on 
at least ten other issues. Of special significance is 
the fact that many of these latter items were direct- 
ly related to young peoples’ thought and values. For 
example, four items with which adolescents tended 
to agree - youth getting rebellious ideas, obeying 
and respecting authority, respect forparents, and 
youth’s need for strict discipline -, the high school 
teachers, almost without exception, felt adolescents 
would disagree with. Also, the fact that these young 
people did not feel that insults should be puni shed, 
certain people are more important than others, or 
that there are two classes of people, whereas this 
particular group of high school teachers felt adoles- 
cents would agree with these items, seems signifi- 
cant. 

Moreover, when these data are compared to col- 
lege students’ estimates of adolescent response to 
F-scale items (3), it seems that experienced teach- 
ers actually do no better than persons who have not 
taught. 

Do adolescents, however, really feel that **obedi- 
ence and respect for authority are the most impor- 
tant virtues children should learn?” That “ young 
people get rebellious ideas, but as they grow up they 
Ought to get over them?” That ‹‹ по weakness Or 
difficulty can hold us back if we have enough will 
power?” Do they really feel that people cannot be 
divided into two classes? That the business man is 
not more important than the artist or profes sor? 
That familiarity does not breed contempt? Th e 86 
problems deserve further study. 

Also, do teachers feel that high school students 
are really “‘authority rebels, ’’that they dislike con- 
trol, and that they do not want to show respectíor 


FRYMIER 355 


TABLEI 


ADOLESCENTS' RANKING ACCORDING TO AGREEMENT OF F-SCALE ITEMS 


ЕЕРЕЕ 


Adolescents* Ranking Teachers! Estimates of 
According to Agreement Item Adolescents’ Rank i ng 
оО ee ыу ыу ee А 
1 No weakness or difficulty can hold us back if we ТІ 
have enough will power. 
2 Young people sometimes get rebellious ideas, but 15 
as they grow up they ought to get over them and 
settle down. 
3 Obedience and respect for authority are the most 25 
important virtues children Should learn. 
4 Nowadays more and more people are prying into 2 
matters that should remain personal and private. 
5 Science has its place, but there are many important 6 
things that can never possibly be understood by the 
human mind. 
6 There is hardly anything lower than a person who 16 


does not feel a great love, gratitude and respect 
for his parents. 


7 If people would talk less and work more, everybody 14 
would be better off. 


8 A person who has bad manners, habits and breeding 1 
can hardly expect to get along with decent people. 


9 What the youth needs most is strict discipline, rugged 26 
determination, and the will to work and fight for 


family and country. 


10 Human nature being what it is, there will always be 12 
war and conflict. 


11 No sane, normal, decent person could ever think of 10 
hurting a close relative or friend. 


12 The wild sex life of the ancient Greeks and Romans 22 
was tame compared to some of the goings-on in this 
country, even in places where people might least 


suspect it. 
13 What this country needs most, more than laws and 5 


political programs, is a few tireless, courageous, 
devoted leaders in whom the people can put their faith. 
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TABLE I (continued) 


сна а ee ES MM eee 


Adolescents’ Ranking Teachers’ Estimates of 

According to Agreement Item Adolescents’ Ran king 

PEPENE PRES UL O EDO e ACD Kela ÓÀÀ— ÀÀ—ÓÀ 
14 Sex crimes such as rape and attacks on children 3 


deserve more than mere imprisonment; such 
criminals ought to be publicly whipped or worse. 


15 Most of our social problems would be solved if 8 
we could somehow get rid of the immoral, crooked 
and feeble-minded people. 


16 Someday it will probably be shown that astrology 23 
can explain a lot of things. 


17 Every person should have complete faith in some 17 
supernatural power whose decisions he obeys 
without question. 


18 Most people don’t realize how much our lives are 18 
controlled by plots being hatched in secret places. 


19 Nowadays when so many different kinds of people 20 
move around and mix together so much, a person 
has to protect himself especially carefully against 
catching an infection or disease from them. 


20 When a person has a problem or a worry, itis 9 
best for him not to think about it, but to keep busy 
with more cheerful things. 


21 Familiarity breeds contempt. 24 

22 An insult to our honor should always be punished. 4 

23 Some people are born with an urge to jump from 21 
high places. 

24 Wars and social troubles may someday be ended 19 


by an earthquake or flood that will destroy the 
whole world. 


25 The business man and the manufacturer are much 13 
more important to society than the artist and the 
professor. 

26 Nobody ever learned anything really important 27 


except through suffering. 


27 People сап be divided into two distinct classes: 11 
the weak and the strong. 


T———————————À——É—— —oá—— O uc 
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authorities of any kind, as the data in this particu- 
lar study seem to imply? Andif high school teach- 
ers feel this way, why do they? And if they do, why 
are they teaching? These problems, too, warrant 
further investigation. 


Summary 


Five hundred thirty-six high school students and 
83 high school teachers were measured with the F- 
scale. The way high school students agreed an d 
disagreed with the various items of the measure was 
compared to the way high school teachers thought 
high school students would agree or disagree. The 
teachers correctly estimated adolescent response to 
ten items, but incorrectly estimated response to 
ten other items. Certain problems were posed for 
further study. 
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Developments in the Teaching of Secondary 


School Mathematics 


I. Introduction 


In recent years the mathematics curriculum in 
the secondary school has been receiving some con- 
siderable consideration as to its adequacy and its 
need for articulation with the college curriculum. 
This being the situation, it is most important that 
the secondary school mathematics teacher be aware 
of recentdevelopments іп (һе teaching of mathemat- 
ics and, as а corollary, develop his background in 
some of the more modern mathematical concepts. 

Although a large amount of material has been pub- 
lished which can be of assistance to the secondary 
school mathematics teacher, it is widely scattered 
throughout the literature. It would seem that a sum- 
mary of some of the more important work being un- 
dertaken, with brief comments, would be of value to 
many teachers. 

The discussion presented here will consider the 
following areas: The work of the Commissionon 
Mathematics of the College Entrance Examination 
Board, Contributions of the National Council of 
Teachers of Mathematics. The education of Teachers 
of Secondary School Mathematics, The support given 
to the education of secondary school mathematics 
teachers by the National Science Foundation, Studies 
being carried on by the University of Illinois Com- 
mittee on School Mathematics, The efforts of the 
School Mathematics Study Group, and The work of 
the Committee onthe Undergraduate Program of the 
Mathematical Association of America. 


П. The Workofthe Commission on Mathematics of 
e College Entrance Examination Boar 


The Commission on Mathematics of the College 
Entrance Examination Board was an outgrowth of the 
concern of the Mathematics Examiners for the val- 
idity of their tests. With the introduction of many 
topics in modern mathematics into the curriculums 
of some forward looking high schools and the need 
for this type of mathematics as a prerequisite for 
the study of mathematics atthe college level, it was 
feltthatanexamination of the secondary school math- 
ematicscurriculum was in order. Thus, in 1955 the 


` Several progress reports which were widely distrik 
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Commission on Mathematics was appointed. Its wor. 
was supported jointly by the College Entrance Exan 
ination Board and the Carnegie Corporation of 
York. The objectives of the Commission were 
consider broadly the secondary school college pri 
aratory curriculum and to make recommendati 
looking towardsits modernization, modification, 
improvement. ” These objectives have been presen 
ed in detail in a pamphlet entitled Objectives of 
Commission on Mathematics of the College Entrance 
Examination Board (11:1-5). Л 


The Commission was fortunate to have Dean А 
bert E. Meder, Jr. of Rutgers University as execu- 
tive director during most of its life and Dr. All 
W. Tuckerof Princeton University as chairma 
Among the many distinquished educators serving On 
the Commission were C. B. Allendoerfer of the Uni- 
versity of Washington, H. F. Fehr of Columbia Uni- 
versity, and S. S. Wilks of Princeton University. 

The progress of the Commission was reported i 


uted to people having an interest in their work. 
addition, the writing groups of the Commission pro 
duced a number of pamphlets as the work progresse 
Inasmuch as these pamphlets supplement the f і 
reportandare valuable reading material for ће sec- 
ondary school mathematics teac her, several of t! 
are listed here: 


L oe the Mathematics Curriculum 
2. “Modern Mathematics and Its Place in t 
Secondary Schools’? (9). 

3. “The Education of Secondary School Mathem: 
ics Teachers” (6). 

- “The Introduction to Algebra from the PO 
of View of Mathematical Structure" (7). 

- “Sets, Relations and Functions'* (13). 

+ “The Mathematics of the Seventh and E 
Grades” (8). 


oc A 


The work of the Commission on mathematics 2 
culminated early іп 1959, at which time the final 
port (12) was presented. The report included a d 


cussion of the urgent need for curriculum revision 
insecondary school mathematics, the premises upon 
which recommendations were made, a recommended 
program for grades 9 through 12, the education of 
mathematics teachers, andarticulation of secondary 
with college mathematics programs. 

Published in a separate volume were Appendices 
to ће Report of the Commission on Mathematics. 
The introduction sets forth its purposes as follows: 


“These appendices are not classroom units, nor 
are they offered as a manual on methodology. One 
oftheir purposes is to provide information, instruc- 
tion, and enrichment that will assist teachers in 
preparing themselves to translate the recommenda- 
tions of the Report into action, as classroom mat- 
erials become available. . . Another purpose of the 
appendices is to amplify and clarify certain of the 
Commission's recommendations.” (5: vii). The areas 
of mathematics covered by the Appendices are alge- 
bra, geometry, and trigonometry. In addition, re- 
ferences are given to important materials that can 
be obtained easily. 

Other publications of the Commission on Math- 
ematics include: 


"Introductory Probability and Statistical In- 
Тегепсе” (with teachers notes and answer guide 
іп а separate volume). 

“Concepts of Equations and Inequality. "' 
“Informal Deduction in Algebra; Properties 
of Odd and Even Numbers. ”” 


Each of these publications can be purchased at a 
nominal cost from the College Entrance Examination 
Board, c/o Educational Testing Service, Box 992, 
Princeton, New Jersey. It would be well for every 
teacher of secondary school mathematics to have 
each of these in his library, or at least have them 
available to him through his school library. 

The study conducted by the Commission is among 
the most significant work that has been done during 
thetwentieth century with regard to secondary school 
mathematics. 


Ш. Contributions of the National Council of Teachers 
of Mathematics 


The organization which was probably made the 
greatest overall contribution to the advancement of 
Secondary school mathematics is the National Coun- 
cil of Teachers of Mathematics (NC TM). This is а 


department of the National Education Association. 
Its journal, The Mathematics Teacher, contains 
many suggestions, whichare useful іп the classroom 
and help the practicing teacher remain current on 
happeningsin the worldof mathematics. The Coun- 
cil also produces other useful and interesting pub- 
lications As examples of these publications, sev- 
eral will be discussed briefly. : 

Two booklets are available which deal. with gifted 
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students. These are Mathematics for the Academi- 
cally Talented Student (18), a report of a joint con- 
Terence with the NEA, and Program Provisions for 
the Mathematically Gifted Student in the Secondary 
School, by E. P. Vance and others (29). The first 
Suggests guides for providing a mathematics pro- 
gram for academically talented students, while the 
other presents programs which have been developed 
in several types of schools. 

The National Defense Education Act of 1958 (Pub- 
lic Law 85-864) provides for the purchase of equip- 
ment, including teaching aids for mathematics and 
other subjects. Because of the importance of this, 
the NCTM has had prepared a booklet, A Guideto 
the Use and Procurement of Teaching Aids for Math- 
ematics, by Emil J.BergerandDonovan А. Johnson 
(3). T booklet may be purchased from the NCTM 
for 75 cents a copy. The booklet not only includes 
alisting of available training aids, the source of 
procurement and price, but also discusses or lists 
such things as the use of training aids, Public Law 
85-864, producers of mathematical films and film 
strips, school library lists and the organization of 
School libraries. 


Perhaps the premium productof the NC TM is its 
series of yearbooks. The Twenty-third Yearbook 
will be discussed briefly to provide the reader with 
some understanding of the scope of these products 
(24). It isentitled Insights into Modern Mathematics 
and furnishes background and reference materi 
for modern mathematics. Each chapter deals with 
a special phase and is written by an expert inthe 
field. For example: Chapter II by Ivan Niven deals 
with “Тһе Concept of Number, ’’ Chapter III by E. J. 
McShane covers ‘‘Operating with Sets’, Chapter VI 
by Walter Prenowitz discusses ‘‘Geometric Vector 
Analysis and the Concept of Vector Spaces’’, Chap- 
ter X by R.H. Bingis on «point Set Topology’’, and 
Chapter XI by Herbert Robbins presents “Тһе Theory 
of Probability". The editorial committee consisted 
of F. Lynwood Wren (Chairman), Carl B. Allendoer- 
fer, Saunders MacLane, Bruce E. Meserve, and 
Carroll V. Newsom. 


Although all of the yearbooks are of value to sec- 
ondary school mathematics teachers, several, in 
addition to the Twenty-third Yearbook, are worthy 
of special mention. The Fifteenth Yearbook, entitled 
The Place of Mathematics in Secondary Education, 
is of special interest (26). Unfortunately, this book 
is out of print and can no longer be purchased; how- 
ever, it is available in many libraries. The Twenty- 
first Yearbook, entitled Learning of Mathematics, 
Its Theory and Practice, is an exposition on the ap- 
plication of learning processes to the classroom(25). 
The Twenty-fourth Yearbook (23), The Growth of 
Mathematical Ideas, Grades K-12‘‘sugg ests how 
basic mathematical ideas, modern or traditional, 
can be made continuing themes in the de velopment 
of mathematical understandings”. All yearbooks 
still in print can be purchased from the NCTM. 
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IV. The Education of Teachers of Secondary School 
Mathematies 


А keen and widespread interest in the education 
of secondary school mathematics teachers exists 
today. Special attention is being given to it by teach- 
er accrediting agencies. An organization which is 
actively looking into the preparation of teachers is 
the Committee on Teacher Education and Profes- 
Sional Standards (TEPS). 

The Commission on Mathematics of the College 
Entrance Examination Board has devoted a consid- 
erablepartof its final report (12:48-58), which was 
referred to above, to the education of secondary 
School mathematics teachers, as well as publishing 
а separate pamphlet on the subject (6). It is pointed 
out that the success of any curriculum recommenda- 
tions depends on a well trained teachingforce 
throughout the country totranslate them into reality 
By well trained, they mean trained to ‘‘teach the 
subject matter in the spirit oftwentieth-century 
mathematics". 

In order to provide the secondary school mathe- 
matics teacher with instruction which will be help- 
ful in enriching his background, the Commission on 
Mathematics has presented in the appendices to its 
report (5) some study materials for teachers. Fur- 
ther, the Commission on Mathematics, with the 
School Mathematics Study Group, is planning the 
preparation of study guides, which can be used by 
individuals or groups for self-improvement. 

Although there exists only alimited number of 
mathematics books designed to meet the needs of 
Secondary school teachers, the number is expected 
to increase. In addition, various professional pub- 
lications, such as The American Mathe matical 
Monthly, The Mathematics Teac er, and 5сһоо 
Science and Mathematics, canbeof valuable assist- 
ance to the teacher. 

Many secondary school mathematics teachers 
have been at a loss to know what constitutes an ad- 
equate preparation for their job. Report of the Com- 
mission on Mathematics (12:54-58) presents alist 
of suggested courses which is considered to be ad- 
equate preparation for those undertaking an under- 
graduate program preparatory to teaching secondary 
School mathematics. Some further comments on the 
Subject of what constitutes adequate preparation to 
teachhighschool mathematics were made as part of 
а panel discussion at the annual meeting of the New 
Jersey Section of the Mathematical Association of 
America on November 1, 1958. These comments 

were made by Meder, Lorch, Tucker, Meser ve, 
Walter and Fender. They are summarized in The 
American Mathematical Monthly (21:805-11) 
(22:909-14). 

Through the efforts of various foundations, es- 
pecially the National Science Foundation (NSF), 
Special courses are being designedfor the in-service 
teacher who returns to the campus for a summer 

yetudy program or even a full academic year. The 
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NSF has for several years sponsored institute pro- 
grams for secondary school teachers of mathematics 
and science. There are three institute programs 
which have as their prime objective “(һе improve- 
ment of the quality of science and mathematics in- 
struction in the Nation’s classrooms’ (28). Thethree 
institutes are Summer Institutes, Academic Year 
Institutes, and In-service Institutes. 

The Summer Institutes during 1962 will provide 
financial aid for about 20,000 teachers to engage in 
summer study. The Academic Year Institutes will 
provide stipends for about 1, 500 teachers for a full 
academic year of study. While theIn-service Insti- 
tutes provide only travel allowances for non- school 
hourtraining, the NSF does encourage and give guid- 
ance to educational institutions willing to provide 
Such institutes. Detailed information concerning 
these programs may be obtained by writing to the 
National Science Foundation, Washington 25, D.C. 


V. Work Being Carried on by the University of Шіп- 
ois Committee on ool Mathematics 


Some of the most valuable work in the develop- 
ment of instructional units for secondary school 
mathematics (grades 9 thru 12) is being done by the 
University of Illinois Committee on School Mathe- 
matics (UICSM). The director of this project is Pro- 
fessor Max Beberman of the University of Illinois. 
The work of the Committee began in 1951. Until 
1956 it was supported by the University of Illinois. 
Since 1956 the Carnegie Corporation has been con- 
tributing heavily to the support of the project. 
A number of units for high school mathematics have 
been completed and are available for purchase. In- 
formation can be obtained by writing to the University 
of Illinois Committee on School Mathematics, 1208 
West Springfield, Urbana, Illinois. 

The testing and development of these curriculums 
was begun in 1951 by using just two pilot schools in 
Illinois. During the year, 1959-1960, over 80 schools 
throughout the country were trying out the UICSM pro- 
gram. 

Recently, rather wide publicity has been given to 
the work of the UICSM. Professor Beberman dis- 
cussed it at some length in a lecture given at Har- 
vard, which has been published by Harvard Univer- 
Sity Press (1). In this lecture he points out the two 
guiding principals which are followed in the prepar- 
ation of their units. The first is to use precision in 
language and the second is to leadthe student into 
independent discovery of mathematical truths. 

Two articles, whichhave recently been published 
inprofessional magazines, have given wider publicity 
to this worthwhile work. One is an article by M- 
Eleanor McCoy, a UICSM Teacher Coordinator (19): 
The second is by Professor Beberman (2). 


VI. The Efforts of the School Mathematics Study Group. 
VL. The Efforts ofthe School Mathematics Study Group. 


The School Mathematics Study Group (SMSG) 18 


another of the worthwhile efforts to improve t he 
teaching of mathematics for which the National Sci- 
ence Foundation is providing support. The group was 
organized in 1958 and consists of ‘‘college and uni- 
versity mathematicians, high school teachers of 
mathematics, experts in education, and represent- 
atives of science and technology”. Its objective is 
to improve the quality of mathematics teaching and 
to emphasize the teaching of basic concepts rather 
than only mathematical skills. This isto be achieved 
by improving the curriculum, attracting capable 
students to the study of mathematics, and providing 
help for persons preparing to teach mathematics. 

The SMSG hopes to achieve its objectives 
through the establishment of specific projects to be 
supervised by a panel composed of members of the 
group. The overall work will be supervised by an 
Advisory Committee. One of the earliest products 
of the Group was a ‘‘Study Guide in Modern Algebra" 
(27). The guide is for use of teachers and offers 
suggestions on howto read mathematics, what basic 
ideas to seek out, and wheretofindthese basic ideas 
discussed. 

In addition to study guides, the SMSG is prepar- 
ing units in mathematics for classroom use, each 
with a commentary for the teacher. Junior High 
School Mathematics Units (grades 7 and 8) arenow 
available for purchase. Information about study 
guides, units, and other products of the SMSG can 
be obtained by writing to: 


School Mathematics Study Group 
Box 2029, Yale Station 
New Haven, Connecticut 


The SMSG now has four projects under way. These 
аге an experimental project to produce mathematics 
units for grades 7 and 8, as mentioned above; the 
production of sample textbooks which, it is hoped, 
Will provide models for authors of text books; the 
production of a series of monographs which will sup- 
plement regular text books in the mathematics class; 
and the production of teacher training materials. 

Further detail on the topics mentioned above as 
well as information on the progress of the SMSG is 
contained ina series of Newsletters. Any interested 
person can be placed on the mailing list for these by 
writing to the SMSG. 


VII. The Committee on the Undergraduate Program 
of the Mathematical Association of America 
Although the Committee on the Undergraduate 
Program ofthe Mathematical Association of America 
(CUPM) is concerned primarily withthe undergradu - 
ate program at the collegelevel,there are atleast two 
reasons for including a discussion on its work in a 
Paper on the teaching of secondary school mathema- 
tics. First, it is contributing a large part of its ef- 


fort to the investigation of what should constitute ad- 
equate training for teachers of secondary school 
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mathematics. Secondly, itis concerned with the ar- 
ticulation of secondary and college mathematics 
courses. 

The CUPM wasfirst appointed in 1953 and derived 
financial support during the past years of its exist- 
ence fromthe Ford Foundation and the Mathematical 
Association of America. The major product of the 
CUPMtodateis the publication of two texts called 
Universal Mathematics, Parts I and II (15)(14). Al- 

oug] ese volumes might actually be used as texts 
inteaching college freshman mathematics, they were 
intended primarily as guides or models to be used 
by textbook writers. 

In1959the CUPM wasdissolvedforthe purpose of 
expanding the committee and modifying its methods 
of procedure. Certain proposals onthis were made to 
the Governors at the January 1960 meeting of the 
MAA in Chicago. Atthe same meeting the CUPM an- 
nounced that future effort would be exerted in the 
following areas: teacher training, mathematics for 
physics and engineering sciences, mathematics fo r 
Social and biological sciences, and mathematics for 
mathematics majors. A summary of the work of the 
СОРМ iscontained ina recent issue of The American 
Mathematical Monthly (16:213-20). 


УШ. Miscellaneous Items 


Althoughonly a brief discussion has been given to 
each of the above topics, it is hoped that some ap- 
preciation of the research being undertaken in the 
teaching of mathematics hasbeengained. Those pre- 
sented are the major contributions. However, there 
are many smaller efforts being undertaken, which 
have not been discussed. Notable among the omis- 
sions is the work being done on the advanced place- 
ment program, which is being carriedon by the C ol- 
lege Entrance Examination Board in conjunction with 
many secondary schools and institutions of high er 
learning. 

A good introduction to many of the topics discus- 
sed in this paper can be obtained by reading thelast 
mentioned article in Section VII (16:213-20). Con- 
tained therein are brief and up-to-date comments on 
several of the topics discussed in this paper. 

A study of «Problems in Mathematical Education'* 
has been accomplished under the directionof the Ed- 
ucational Testing Service. Funds for the study were 
made available through a grant by the Carnegie Corp- 
oration of New York. The purpose of the study was 
“to describe the kinds of research activities which 
would leadtobetter mathematics courses and teach- 
ing in the elementary and secondary schools" (17:1). 
The results of the study were reported in a booklet 
by Dyer, Kalin, and Lord (17). An extensive biblio- 
graphy is included in the booklet, which is a useful 
source of current literature on the subject. 

One other item of general interest to teachers of 
secondary school mathematics is the availability of 
sound motion pictures, which attempt to clarify 
some of the modernconcepts of mathematics. These 
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can usually be bought or rented at a modest cost. 
The McGraw Hill Book Company, Inc. is an example 
of a producer of such films. Details on their films 
can be obtained by writing to Text-Film Department, 
McGraw Hill Book Company, Inc. 330 West 42nd 
Street, New York. Thereare, of course, other pro- 
ducers of such films. 
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A Note On the Relationships Between MTAI and GZTS Scores 


For Three Levels of Teacher Experience 


Introduction 


IN A RECENT study, Leeds (6) reported re- 
lationships between the Minnesota Teacher Attitude 
Inventory (MTAI) and the various trait scores of the 
Guilford-Zimmerman Temperament Scale (GZTS) 
for а group of 300 in-service teachers (270 women 
and 30 men) drawn from the first to twelfth grades 
ina large South Carolina city. All of the correla- 
tions between MTAI and the GZTS traits were sig- 
nificantly related at the 1 per cent level except for 
G (General Activity), R (Restraint), and T (Thought- 
fulness) scales. These correlations are presented 
in Table II, column 4. Leeds concluded onthe ba- 
sis of his results that performance on the MTAI is 
an indirect measure of personality as this variable 
is measured by the temperament traits of the GZTS. 

Ferguson, ina factorial study of the MTAI 
(2), also correlated MTAI scores and the various 
GZTS trait scores fora group of 117 student teach- 
ers (91 women and 26 men) at the University of 
Missouri teaching at both the elementary and sec- 
Ondary levels. Тһе correlations аге reported in 
column 3 of Table II. Although significant relation- 
Ships were not reported inthe original study, appli- 
cation of significance tests from Garrett (3)show 
that only the P (Personal Relations) scale is signi- 
ficantly related at the 1 per cent level. 

The present investigator after reading the 
above studies observed that the correlations between 
MTAI and the various GZTS traits were generally 
in the same direction but were ofasmall magnitude 
for the student teacher group. This canbe observed 
by the reader through inspection of columns 3 and 
4 of Table II. 

Since the two groups used by Leeds and 
Ferguson represent different levels of teacher ex- 
perience, it occurred to the writer that a third lev- 
el of experience (actually inexperience) might also 
ШО у 2/72 n 
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be investigated to see if the MTAI-GZTS trait cor- 
relations would be in the same general directionand 
of the same relative size as in the two studies cited. 
The addition of the prospective teachers would per- 
mit a comparative study of the relationships between 
the variables at the three levels of teaching experi- 
ence. 

The general purpose of the study, therefore, 
was to compare MTAI-GZTS trait correlations for 
a group of inexperienced or prospective teachers to 
those reported in the previous studies using student 
and in-service teachers. 


Procedure 


The data used in the study were obtained 
from three sources. The correlations for the ex- 
perienced or in-service teachers were o btained 
from the study reported by Leeds. The data for the 
student teachers were obtained from the microfilm 
of Ferguson’s dissertation. The MTAI-GZTS trait 
correlations for beginning or inexperienced teachers 
was obtained by the writer from a group of 196 stu- 
dents (112 women and 84 men) enrolled in Edu- 
cational Psychology at Purdue University during the 
spring semester of 1956-57. This course serves at 
the present time as the introductory course in the 
secondary level professional education sequence. 
The two tests are administered during the first 
week of the course as part of the over-all program 
of student selection and guidance in the Division of 
Education. Product-moment correlations were ob- 
tained between the M TAI raw score and each of the 
GZTS trait raw scores. Sex was disregardedwith 
the exception of the M (Masculinity) scale where 
male and female students were considered as sepa- 
rate groups. This was done to provide comparative 
data since Leeds had obtained similar correlations. 
Ferguson did not report separate correlations for 


* A paper presented to the American Educational Research Association Annual Meeting, St. Louis, Missouri, 


February 23, 1958. 
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females and males on the M-scale. 


Results 


Descriptive data in the form of the means and 
standard deviations for the variables used are ге- 
ported in Table I. Inspection of this table shows 
Some variation in mean MTAI score for the three 
groups. Thestudentteachers havethe highest mean, 
followed by the experienced teachers and beginning 
teachers inthat order. The variability of MTAI 
Scores is largest in the experienced teacher group 
andsmallest in the student teacher group. Generally, 
the means and standard deviations for each of the 
GZ TS traits for the three groups are quite similar. 
The difference in mean values for the M-scale be - 
tween the student teachers andthe other two groups 
càn probably best be explained by the fact that the 
men and women were combined together in the 
former. 

The results relative to the correlation coeffi- 
cients for the three groups are presented in Table 
Il. This table shows the correlations between the 
МТАІ and each of the GZTS traits for the three 
groups involved in the study. 

The correlations for beginning teachers are 
quite similar in direction and magnitude to those ob- 
tained for student teachers but both of these groups 
have correlations generally smaller than those for 
experienced teachers. Exceptions are to be noted 

inthe G, R, and T scales where the correlations 
are quite similar. Such similarity may be due 
to chance factors as wellas to realbutslight dif- 
ferences between the groups. The only trait show- 
ing significance for all three groups was the P-scale. 

The question may be asked as to why the G, 
R, and T scales correlate so low with the MTAI for 
each of the three groups. The answer may possibly 
lie in the fact that these personality traits are not 
measured by the MTAI. In other words, these fac- 
tors contribute only asmallamountto the total var- 
iation of the MTAI score; the largest portion of the 
ee being accounted for by the other seven 
scales. 


Discussion of Results 


Inspection of the MTAI means for the three 
groups as presented in Table I shows a slight cur- 
vilinear relationship. That is, the student teachers 
have a higher mean than does the beginning teacher 
group which in turn has a slightly higher mean than 
the experienced teacher group. This pattern of 
means follows the general patternof means for sim- 
ilar groups as reported in ће MTAI manual (1), re- 
Sults reported in previous studies, and the writer’s 
observations in using the scale with undergraduate 
and graduate students. It probably best reflects the 
acceptance by student teachers of the desirable 
teacher-pupil attitudes developed in education 

courses which are later modified by actual teaching 
experience to less desirable teacher-pupil attitudes. 


The similarity in correlations obtained for 
the beginning and student teacher groups are from 
rational considerations not unexpected. There is 
probably not too much difference between beginning 
and student teacher groups in terms of the person- 
ality variables, particularly in increased variations 
within the traits with additional training. As for the 
P-scale being significant in all groups, this too is 
not unexpected since the Guilford-Zi mmerman 
manual (4) points out that this temperament dimen- 
Sion reflects the ability to get along withothers, an 
area of personality certainly important in teaching. 
The MTAI is also somewhat designed to measure 
this same dimension. Differences between the two 
inexperienced groups in the correlations for the 
other personality traits might be explained by var- 
iability within the groups, real but slight person- 
ality differences, and chance. 

The observed differences in MTAI-GZTS trait 
correlations from the inexperienced and student 
teacher groups when compared to the experienced 
group seem, on the surface, mostsimply explained 
by the presence of greater heterogeneity of MTAI 
Scores among experienced teachers. Increased 
variation of MTAI scores with experience is a con- 
dition also noted in the MTAI manual and has been 
reported in other studies. That is, experienced 
teachers tend to show lower (i.e. authoritarian) 
scores than is true of beginning or relatively inex- 
perienced teachers. Consequently, suchincreased 
variation alone might account for the higher cor- 
relations obtained in the experienced teacher group. 
It might also be true that the difference in correla- 
tions could be due to personality changes occurring 
with teacher experience. 

To determine which of these two conditions 
might be most logically held with these data, the 
correlations for the beginning and student teacher 
groups were corrected for restriction of range ac- 
cording to procedures recommended by Gulliksen 
(5). This procedure enables one to obtain an esti- 
mate of the correlations if these groups had the same 
MTAI variability as the experienced teacher group. 
Along with correcting the correlations for restricted 
range, GZTS-trait standard deviations for these 
Same groups were also corrected to determine how 
much additional variability there wouldbe. The re- 
sults of this step are presented in Table III. 

This table shows that the variability for the 
three groups on the GZTS-traits are still quite sim- 
ilar evenafter correctionfor restricted MTAI range. 
The corrected correlations, however, still do not 
equal those obtained with the experienced teacher 
group although they do approach these values. The 
evidence from this step in the analysis would tend to 
eliminate the factor of greater heterogeneity of MTAI 
Scores as the cause of the differences incorrelations 
and suggests instead that there may be real person” 
ality changes occurring with experience in teaching. 
If this interpretation has validity, it would not sup 
port the use of the MTAI or the GZTS as an initial 
selection device since these scales wouldnot 
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TABLE I 
MEANS AND STANDARD DEVIATIONS FOR MTAI AND GZTS TRAITS 


Variable Beginning Teachers Student Teachers In-service Teachers 
Mean S. D. Mean S.D. Mean S. D. 
N 196 117 300 
MTAI 20. 03 30.72 48.72 28.66 28.6 42.8* 
GZTS Traits 
G 17.57 4,92 15.94 5.05 15.8 5.56 
17.29 5. 08 18.15 4,64 19.9 4.06 
А 15.68 5.73 15.36 5.14 12.5 4.91 
5 21.18 5.40 21.25 5.54 19.2 5.82 
Е 17.73 5.63 18.68 5.18 19.8 5.52 
о 17.68 5.32 19.36 4.52 19.7 4.80 
F 16.20 5.61 18.36 4.69 19.5 4.78 
m 19.61 4.95 18.86 4.19 17.7 4.43 
IB 18.58 5.46 18.89 5.20 19.8 4.50 
M (Male) 19.70 4.17 18.9 3.60 
(Female) 10.56 4. 46 18. 75** 5.30 10.3 3.67 


* Figures reported only to one decimal place in personal communication to 


author. 
** Separate data for males and females not reported. 


TABLE IL 


HOMOGENEITY OF CORRELATIONS BETWEEN MTAI AND 6275 
TRAITS FOR THREE LEVELS OF TEACHING EXPERIENCE 


———— 


Trait Beginning Teachers Student Teachers Inservice Teachers 
(Cook N=196) (Ferguson N=117) (Leeds N=300) 


G - General Activity . 00 .05 . 06 

R - Restraint .13 .09 ‚06 

А - Ascendance .12 .15 .19** 

S - Sociability . 06 .16 .20** 

E - Emotional Stability ТТ, Кат ‚36** 

О - Objectivity .17 .12 44% 

Е - Friendliness .15 .18 236% 

T - Thoughtfulness .07 .09 -.07 

P - Personal Relations pale .28** 252%% 
cei Sem. 7 ШЕ еі, 


жж Correlations significant at 1 рег cent level. 
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TABLE III 


MTAI-GZTS CORRELATIONS AND GZTS TRAID STANDARD DE- 
VIATIONS CORRECTED FOR RESTRICTED RANGE OF MTAI 
SCORES IN BEGINNING AND STUDENT TEACHER GROUPS 
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Corrected SDs 
Beginning Student 


GZTS Corrected rs 
Trait Beginning Student 
G .00 207 
R .18 .13 
A «17 .22 
5 . 08 .24 
E .23 .31 
о .23 .18 
F .21 .26 
T. .10 .13 
P .29 .40 
M(Males) ‚00 ) 
M(Females) .25 y.18 


discriminate sufficiently between prospective teach- 
ers as they would at some later time. 

The results of the study have to be interpreted 
with caution since several questions need to be an- 
Swered. Thefirst, anda rather serious one, relates 
to the differences between the populations involved. 
It is certainly open to question as to whether or not 
the three discrete samples used can be considered 
as having come from the same population, particu- 
larly when two of the groups involved contain both 
Secondary and elementary teachers While the other 
one involves only prospective Secondary level teach- 
ers. For purposes of this Study, however, the as- 
sumption was made that they were. Further inves- 
tigation of this problem would require a more rigor- 
ous control over this factor. 

Another question that needs answeri ng is 
whether the increased correlations in the experienced 
teacher group are due to changes in performance on 
the MTAI, the GZTS, or both. Froma longitudinal 
point of view, evidence is also needed as to what 
point the changes, if any occur. 

A third question is concerned with the орега- 
tion of sex differences. The M-scale shows contra- 
dictory results when comparisons are made between 


4.92 5.06 
5.12 4. 66 
5.77 5.21 
5.41 5.63 
5.72 5.32 
5,41 4.56 
5.67 4.78 
4,96 4.21 
5.57 5.44 
а 


the beginning and experienced teachers. The 
MTAI-M trait correlations for experienced teache: 
Show that the male correlation was larger than 
females while an opposite condition is true for 
beginning teachers. Also, muchsmaller difference 
in correlationwere obtained with the beginning teach- 
ers than with experienced teachers. Chance 
tors, restriction of range, and the relative sma 
male samples might be advanced to explain the di 
ferences. Since it was not possible to obtain Se 
rate correlations by sex for all traits from the th 
Studies, the influence of this factor is somewh 
Speculative. The evidence from the one trait do 
however suggest an area for further research. 1 

А fourth question can be raised with regard 
the operation of age or maturity. Its operation 
only be inferred here through the three levels 
teaching experience reported. Direct investigati 
of possible changes with age onall the variables ust 
might throw additional light on the question of whe 
or not there are true Personality changes with te 
ing experience. 

The writer is currently directing a study d 
Signed to overcome the limitations in the pres е 
Study and to answer the above questions. This Stu 


consists of gathering longitudinal data on the two 
variables (MTAI and GZTS) by repeated testings of 
the same group of students as they enter the profes- 
sional sequence, at the time student teaching is 
completed, and after a minimum period of teaching 
experience. Itis hopedthat this study will give more 
definitive information on the natureof changes in 
performance on the variables and at what period 
such changes may begin to appear. 


Summary and Conclusions 


Investigation was made to compare MTAI- 
GZTS trait correlations for a group of beginning 
teachers, student teachers, and in-service teachers 
to determine if such correlations were of the same 
general magnitude and direction and could be con- 
sidered as having a common population value. 

Results showed thatthe MTAI-GZTS trait cor- 
relations for beginning and student teachers when 
compared to experienced teachers were generally 
in the same direction but smaller іп size. Тһе ex- 
perienced teachers had larger correlations for all 
of the GZTS traits except for the С, R, and T scales, 
These scales were somewhat similar for all three 
groups. The P-scale was the only trait found to be 
significantly related in all three groups. 

The interpretation of results raises questions 
which need answering before more definitive answers 
can be made. These questions revolved around the 
nature of the saniples used, changes in performance 
with experience oneitheror both of the scales used, 
а sex factor, and an age or maturity factor. 

Within the limitations of the study, the follow- 
ing conclusions are presented: 
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1. Beginning and studentteachers show sim- 
ilar relationships between MTAI scores 
and each ofthe various GZTS traits but 
thatthese relations differfrom experi- 
enced teachers primarily in that the cor- 
relations are higher for the latter group. 


2. The presence of greater heterogeneity of 
MTAI scores among experienced teachers 
cannot alone be used to explain the differ- 
ences in correlation between the groups. 


3. The increasing size of correlations with 
teaching experience suggests that there 
may be personality changes taking place 
with increased participation in profession- 
al work. 
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A Psychometric Evaluation of Experienced Teachers 


TWO OF THE MOST extensive studies regarding 
teacher characteristics and teacher behavior were 
the project of the American Council of Education 
(Ryans, 1953), and the studies of the Harvard 
Teacher Education Research Project, a summary of 
which may be found elsewhere (Levin, Hilton and 
Leiderman, 1957). Studies of nonintellectual fac - 
tors have not yielded clearly consistent results. In 
part,this may be due to the variety of measuring 
instruments which have been used and to the differ - 
ent samples which have been tested. The most 
consistent trait descriptions and interests reported 
were: cooperativeness, friendliness and interest іп 
working with people. Other studies indicated emo- 
tional stability, responsibility, interests in personal 
relations (Gowan, 1957), and a liking for quick- 
ness of action and production as well as efficiency 
(Jones, 1956). 

The purpose of the present study was to deter- 
mine the scholastic aptitude, personality traits and 
interests of experienced teachers pursuing advanced 
degrees in education through the use of standardized 
measuring instruments. Domendiffer from women 
and do subjects pursuing a master's degree differ 
Írom those pursuing a doctor's degree? 


Sample 


The sample consisted of 77 experienced teachers 
(35 men and 42 women), enrolled in graduate edu- 
cation courses atthe University of Oklahoma during 
the summer session. For each subject, data con- 
cerning his teaching experience, administrative ex- 
perience, non-teaching experience, residence and 
degree sought were obtained from a biographical 
form he completed beforetakingthetests. The age 
range was 22 to 55 years with a median age of 34 
years for men, and for women the range was 22 to 
62 years with a median age of 41 years. Teaching 
experience ranged from one to 12 years at both the 
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elementary and secondary levels with median teach- 
ing experience of 4, 5 yearsfor men, and for women 
experience ranged from one to 33 years at the ele- 
mentary level and one to 21 years at the secondary 
level with medians of eight and seven, respectively. 
The range of school administrative experience atthe 
elementary level was zero to six years, median -5, 
and at the secondary level the range was from four 
to 24 years, median -15, for men. Only one woman 
reported a year's administrative experience at the 
elementary level, noneat the secondary level. Non- 
teaching experience ranged from four to nine years 
for men and two to four years for women with me- 
dians of five and three, respectively. Themajority 
of teachers resided in Oklahoma. Twomen resided 
in Kansas and one woman each in New Mexico and 
California. Twenty men and 33 women were pursu- 
ing master's degrees in education and 15 men and 
nine women were pursuing doctor's degrees. 


Procedure 


The School and College Ability Test (SCAT-V and 
SCAT-Q) was administered to the subjects during 
two separatesessionson the same day. The Minne- 
sota Teacher Attitude Inventory (MTAI), Guilford- 
Zimmerman Temperament Survey (G-Z), Allport- 
Vernon Study of Values, revised, Strong Vocational 
Interest Blank for men (only the nonoccupational 
Scales were used: Occupational Level - OL, Inter- 
est Maturity - IM, Masculinity-Femininity - MF, 
and Specialization Level — SL), and the Kuder 
Preference Record, Vocational Form CM, were 
given to the subjects to complete at leisure. These 
tests were selected because of their widespread use 
in teacher-training institutions, all were untimed 
except the scholastic aptitude test, and the availa- 
bility of reliable normative data. 

For comparative purposes, all obtained raw 
scores were converted to T scores except the re- 
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TABLEI 


MEANS AND STANDARD DEVIATIONS OF TEST SCORES 


Test Men Women Master Doctor 
(N-35) (N=42) (N-53) (N=24) 
Scale 
Mean S.D. Mean S.D. CR Mean 8.0. Mean S.D. CR 

SCAT (V) 57.5 8.8 57.4 7.3 57.3 8.4 57.9 7.6 
5САТ (0) 51.3 7.8 54.2 8.9 51.6 8.6 54.8 7.4 
MTAI 47.0 11.6 42.3 7.8 2.02* 44.0 11.2 46.8 TEE 
Strong OL 52.0 7.3 51.8 7.5 50.7 8.1 54.7 4.4 2.79** 


MF 44.6 3.6 38.7 4.4 6.34** 41.1 5.1 42.1 4.5 
SL 41.8 9.0 40.7 7.2 40.1 8.9 42.3 6.4 


IM 49.3 3.4 51.2 2.3 2.79** 50.2 3.0 50.1 3.2 


G-Z G 51.0 8.6 50.7 10.7 50.3 9.0 51.9 10.7 
R 57.7 9.5 55.6 8.0 56.9 7.8 56.4 11.1 
A 50.3 10.0 49.9 9.9 49.6 9.4 51.4 11.1 
5 42.5 4.3 41.0 3.4 41.8 4.1 41.8 3.8 
Е 52.5 8.6 54.5 10.7 52.6 10.0 55.3 8.5 
о 54.1 9.8 56.5 9.4 54.5 10.5 56.6 7.5 
Е 55.6 8.6 56.6 10.3 56.2 9.3 55.7 9.7 
T 53.4 8.9 52.7 9.6 52.9 8.7 53.6 10.4 
P 58.2 7.9 57.6 9.7 57.4 9.3 59.1 7.3 
м 51.9 10.9 46.2 7.8 2.63%% 48.9 8.9 48.6 11.3 
Kuder ош 51.1 11.6 49.5 10.7 52.1 11.8 46.6 8,6 2,68% 
Mech 47.1 10.4 43.4 8.6 48.1 9.7 39.5 6.9 4.35% 
Comp 48.4 12.0 47.2 10.2 46.8 11.5 50.0 10.3 
Scien 41.5 9.9 46.7 9.1 46.2 7.9 49.1 12.3 
Регв 53.5 11.4 50.1 7.4 51.5 11.0 53.0 6.8 
Art 52.5 9.0 48.8 12.8 52.8 11.2 46. 6 9.2 2.50* 
Lit 52.7 10.2 53.4 11.5 51.5 10.8 56.4 10.2 
Mus 50.2 7.5 52.3 12.4 50.1 10.4 53.6 8.8 
So Serv 55.5 8.8 58.5 8.7 56.3 9.5 58.0 7.4 
Cler 45.2 10.1 46.1 10.6 45.0 9.3 41.0 12.2 
Study of Th 39.0 6.7 42.1 6.1 2,11% 39.7 6.9 41.9 5.6 
ir Ec 36.8 5.9 39.5 6.0 1.96* 38.5 5.8 31.0 6.3 


Ae 39.8 7.3 34.3 8.2 3.07** 36.7 7.5 38.6 9.4 


So 38.3 6.0 31.3 5.3 38.0 6.0 31.4 5.0 


Po 37.0 5.6 39.1 7.2 87.7 6.4 38.5 6.3 


5e 2:49:19 2а нв 7:9 йз та 4667 те 


ж Significant ас. 05 level 


** Significant at . 01 level 
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sults from the Study of Values, where raw scores 
were used. 


Results 


Group means andstandarddeviations for each 
test variableby sex and degree soughtare presented 
in Table I, together with the critical ratios which 
were applied to test the significance of the differ- 
ences between the means. Theomission of critical 
ratios for specific variables indicates that no sta- 
tistically significant differences were obtained. 
When comparisons were made between men and 
women, men scored significantly higher on the 
MTAI and aesthetic scale of the Study of Values. It 
is evident that the attitudes of these men toward 
getting along with pupils and their interpersonal 
relationships were better than for women and that 
the men were more satisfied toward teaching as a 
vocation. The women were less aesthetic, but 
more theoretical and economical than the men. The 
significant differences on the masculinity and fem- 
ininity scales were expected because of the differ- 
ence in nature of interests and activities for men 
and women in our society. The roleof interest 
maturity was greater for women which suggests that 
they might be characterized as being more socially 
sensitive, more tolerant, more insightful and better 
organized. Both groups averaged less than the 
average IM score equal to the mean score of 30- 
year-olds (Strong, 1943, p. 263). 

When mean comparisons were made between the 
groups of subjects pursuing a master's degree and 
doctor's degree, the doctoral students scored sig- 
nificantly higher on the OL scale by four standard- 
Score points, Darley (1941, p. 60) suggested that 
OL is a quantitative statement of level of aspiration, 
а measure of motivation, and implied that it could 
be an indication of socio-economiclevel. It is gen- 
erally accepted that teachers are classified at the 
middle socio-economic level (Burton, 1953, p. 248), 
and the mean OL scores obtained here substantiate 
this generality. The only other significant differ- 
ences were found on the Kuder Preference Record, 
where the master's degree group were more inter - 
ested in outdoor activities and in the mechanical-and 
artistic areas. 


Discussion 


In general, the results of this study support the 
findings obtained from previous studies which were 
cited. Although no tests of significance were deter- 
mined among test variables and no dominant pat- 
terns existed for the groups, the highest mean 
scores suggest the following characteristics for 
this sample of experienced teachers. As a group, 
they showed greater ability in working with verbal 
material than with quantitative, more restraint, 
greater adeptness in personal relations, interests 


toward social service and relative importance upon 
moral and ethical standards. Since none of the ob- 
tained mean T scores exceed plus and minus one 
standard deviation, it could be implied that experi- 
enced teachers are not essentially different from 
people-in-general as measured by the standardized 
tests used in this study. The behavior traits and 
interest patterns could have become neu tralized 
since the subjects tested were relatively heteroge- 
neous with respect to interests in various subject- 
matter areas and in specialized fields. Also, even 
though good rapport was established with the sub- 
jects at time of testing, many of the teachers ap- 
peared skeptical about taking tests which could ac- 
count for the lack of any dominant characteristic 
patterns emerging. Therefore, it would seem nec- 
essary to conclude that until the skepticism on the 
part of teachers and their attitudes toward being 
tested can be removed, a clear-cut description of 
their measured characteristics cannot be accurate - 
ly determined. 


Summary 


Measures of scholastic aptitude, personality 
traits and interests were obtained from 77 experi- 
enced teachers enrolled in graduate education 
courses at the University of Oklahoma. The mean 
Scores were significantly higher for men on the 
MTAI, MF scales of the SVIB (M) and G - Z, and 
aesthetic scale on the Study of Values. Women 
scored significantly higher on the IM scale ofthe 
SVIB (M) and the theoretical and economical scales 
on the Study of Values. Mean comparisons between 
the master’s and doctor's groups showed a signifi- 
cantly higher mean score on the OL scale of the 
SVIB (M) in favor of the doctor's group, while the 
master's group means were significantly higher in 
the outdoor, mechanical and artistic areas on the 
Kuder Preference Record. Significant differences 
were not obtained between groups among other var- 
iables. Since no dominant characteristic patterns 
emerged from the results of standardized tests, it 
was concluded that experienced teachers were not 
essentially different from people-in-general. 
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FIELD NEWS continued.... 


1 
JOHN C. WEAVER, VICE PRESIDENT FOR RESEARCH AND DEAN OF THE GRADU- 
ATE COLLEGE AT THE STATE UNIVERSITY OF IOWA states that there 
should be a sharp increase of federal un-earmarked, non-project grants for 
higher education which can be spent as a university deems wise within its own 
operations. He points out that during the year, 1960-61, unive rsities spent 
about $150, 000, 000 for their own research activities, while business corpo- 
| rations and private foundations contributed an additional $100,000,000. The 
| federal government during the year spent $879, 000, 000 in direct support of 
research on the campuses of the nation. A realistic fact of the matter is that 
a very considerable part of these funds are being spent by a multitude of un- 
| coordinated branches of government, eac h seeking to accomplish its special 
| objectives through the use of the higher educational establishment. 
| 
| 
| 
l 
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CHOLARSHIP reports that one-fifth of the 
k in the top third on graduation and who 
llege do not enroll in college. Of these, 
nd forty-four percent of the women say 
th schooling because of lack of money. 
lling toborrow, whilethe others 
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high-school students who гап! 
: have indicated interest in co 
sixty-eight percent of the mena 
| that they did not continue wi 
About one-half said they would be wi 
objected to going into debt. 
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THE NATIONAL SCHOOL BOARDS ASSOCIATION is asking school board members 
throughout the country about their stand on the question of federal aid to edu- 
havebeendistributed. More than 10,000 


cation. Nearly 100,000 opinionnaires 5 
school boards are included. The NSBA estimates that this will represent about 


ninety-five percent of the nation's enrollment in public elementary and secon- 
dary schools. The results of this sur vey are to be released at the National’ 
Convention to be held in St. Louis, April 12-14. The NSBA is attempting to 


find out answers to questions such as, whether a particular district receives 


federal aid for vocational education, for school lunches, for National Defense 
Education Act р 


rojects, for federal aid to impacted areas; and whether the 
board member thinks that federal funds for each of these purposes should be 
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An Experiment in the Teaching of General Psychology 


Introduction 


In the teaching of general psychology, the most 
Írequent practice in Americanuniversities for many 
years has been to offer aone semester, three credit 
course. Usually, under this plan, students hear 
three lecture discussions each week and read one 
general textbook in psychology. What was assessed 
in the present experiment was the effects of chang- 
ing both the number of formal instructor - student 
contact hours and the amount of required reading. 


Procedure 


Six sections of general psychology (four in the 
fall semester and two in the spring semester, 1959- 
1960), constituted the study groups. Altogether, 194 
Students were involved. Each semester half of the 
participating sections were designated as Control 
groups and the other half, Experimental groups. 

The Control groups were taught in the conven— 
tional manner described above. That is, the in- 
Structor conducted lecture/discussion meetings for 
three 50-minute periods each week; and each stu- 
dent read one widely adopted textbook in general 
psychologyl. Eight bi-weekly examinations were 
administered, followed atthe close of the semester 
by à comprehensive objective examination covering 
basic psychological principles which had been em- 
phasized throughout the entire course. It was the 
relative performance of the two groups on this lat- 
ter examination that formed the principal cri terion 
of the effectiveness ofthetwo instructional methods. 

Students in the Experimental sections were as - 

signed the same basic textbook in general psycho- 
logy as was the case for the Control students. In ad- 
dition, they were required to read two volumes of 
selected readings in general psychology2. For the 
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most part, these volumes consisted of collections 
of either current or classic research in the field of 
general psychology. A course outline was furnished 
which articulated the outside readings with the 
chapters in the basic textbook. 

Class meetings with Experimental sections were 
held once each week. During this time, lecture/ 
discussions centering upon the current topic of the 
course were carried on. Eight bi-weekly examina- 
tions were administered with both the examination 
dates and topics coinciding with the testing schedule 
for the Control sections. The meetings of Experi- 
mental students for purposes of examination were in 
addition to the once-weekly lecture discussion pe- 
riods. The final comprehensive examination, re- 
ferred to earlier, was identical in all respects to 
that administered to the Control groups. 


Results 


The results of this study revolve mainly around 
the scores obtained by the two groups on the final 
examination. This examination consisted of 150 
carefully selected multiple choice items. Each item 
was given a weight of three for purposes of grading, 
making it possible for a student to earn 450 points 
on the final. 

Because of considerable disparity in size among 
the six sections, no analysis of individual sections 
was made. Instead, the scores of all Control stu- 
dents were combined, as were all the Experimental 
scores. Table I shows the overall comparison of the 
two groups. From this table it is apparent that the 
performance of the Experimental group was signifi- 
cantly poorer than that of the conventionally taught 
Control group. 

Because the comparison of the two groups as pre - 
sented in Table I is Statistically significant, a point 


T Ruch, Floyd L. Psychology and Life. Brief 5th Edition, Scott, Foresman and Company, 1959. 
2 Dulany,D.E. et al. Contributions to Modern Psychology. Oxford Press, 1958. 


Hartley, Е.І. and Hartley, Ruth E. Outside Readings in Psychology. 2nd Edition, T. Y. Crowell Company 1958. 
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to be discussed more fully in the next section of this 
paper, it raises the following question: “15 this dif- 
ference attributable to the different teaching methods 
or is it due to an initial superiority in scholastic 
ability of the higher group?" То resolve this issue 
the cumulative grade point average of all participat- 
ing students was secured from the Registrar’s of - 
fice, and a comparison made between the Control 
and Experimental groups. Table II summarizes 
that comparison. 

Because data presented in Table II permit the 
conclusion that the two groups were not significant- 
ly different in scholastic ability, three additional 
comparisons were made. Following the observation 
that the two groups performed differently on the fi- 
nal examination criterion, the question arose as to 
whether this would hold true if comparisons were 
made among: (a)the scholastically superior students 
in the two groups; (b)the scholastically average stu- 
dents; and (c) the scholastically weak students. Ac- 
cordingly, the final examination scores of the Ex- 
perimental and Control students who had grade point 
averages of one sigma ormore above ће mean(i. e. , 
the superior students) were compared; a second 
comparison was made of th e final examination 
scores of Control and Experimental students with 
grade point averages within one-half sigma of the 
mean (i.e., the average students); and а third com- 
parison was made between the final examination 
scores of Control and Experimental students with 
grade point averages of one sigma or more below 
the mean (i. e. , the weak students). Table III sum- 
marizes these data. 

It is apparent from Table III that the differences 
for all three groups are consistently in favor of the 
Control students who were taught in the convention- 
al manner. Moreover, the magnitude of the differ- 
ences is roughly the same in each comparison. 
While the 20 point difference in means for superior 
students fell just short of statistical significance, a 
similar difference in the case of average students 
was significant at well beyond the .05 level of con- 
fidence. It should be noted, however, that the num- 
ber of students in the superior group was quite 
Small. The difference in means for the two groups 
of weak students,though larger than the other two, 
fell considerably short of statistical significance . 
This, too, was partly a function of the small num- 
ber of weak students involved. 

Before turning to a discussion of the data, o ne 
final observation should be reported. Because stu- 
dents have a large measure of freedom to withdraw 
from courses, itwas pertinentto determine whether 
the proportion of withdrawals differed significantly 
under the two instructional methods. Accordingly, 
а standard error of a proportion (i. 6.» Sigmay was 
computed on withdrawals. The difference inthe pro- 
portion of withdrawals Was found to be of no statis- 
tical significance. 


Discussion 


All of the comparisons shown in Tables I, II and 
ШІ present tests of significance of differences in 
means of the two groups. In TableI are found the 
results from comparing final examination scores of 
Control and Experimental students. The difference 
in means, favoring the Control group, is significant 
at beyond the . 05 level of confidence. Sinceit is 
known from the data presented in Table II that the 
two groups did not differ significantly in general 
academic proficiency, it may be concluded that the 
conventional method of instruction resulted in great- 
er gains for the Control students than did the Exper- 
imental method for the Experimental students. The 
latter required students to seek out much of the 
basic information through selected readings. 

When only the superior students of the two groups 
were compared, as shown in Table III, there was no 
significant difference in means, although, as in the 
other comparisons, the Control subjects had, on the 
average, higher scores. Very nearly the samefind- 
ings were recorded for weak students, except that 
the statistical test fell further from significance. 
When scholastically average students were com- 
pared, the difference emerged as quite significant, 
falling just short of the .01 level of confidence. 

These findings hold a number of important impli- 
cations. One of the first to arise suggests that the 
classroom instructor cannot,forthe majority of stu- 
dents, be replaced simply bythe introduction of se- 
lected readings. Betterlearning and retention seems 
to be clearly related tofrequent instructor-contacts. 
This, while true in general, is especially true for 
the scholastically average student. The superior 
student, on the other hand, while he tends to perform 
better when meeting often with his instructoroman- 
ages to achieve nearly as much through independent 
reading. The scholastically weakstudent, however, 
retains his borderline proficiency under either in- 
structional method. 

The results of this study lend strength to a num- 
ber of recommendations. A possible one would be 
to group students in classes suchas general psychol- 
ogy according to academic ability. This, of course, 
would be a step toward the honors program which 
has excited so much discussionduring the past year. 
These select classes, withless time devoted to lec- 
turing andagreater emphasis upon independent read- 
ing, and with the added motivation accruing from be- 
ing nominated as honor students, might surpass the 
performance obtained when grouped conventionally 
as in the present study. 

The second recommendation grows out of the ob- 
servation that average students demonstrated a con- 
siderably greater proficiency as their instructor con 
tacts increased. It would seem, for these students , 
that any change which increased their contacts with 
instructors would prove beneficial. The greater in- 
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TABLEI 


COMPARISON OF FINAL EXAMINATION SCORES 
OF CONTROL AND EXPERIMENTAL STUDENTS 


Control Experimental 
x. N 85 76 
Mean 283.5 266. 3 
Sigma 49.9 49.4 
Difference in Means = 17. 2 
5 сао EC E = .05 
TABLE II 


COMPARISON OF CUMULATIVE GRADE POINT AVERAGES 
OF CONTROL AND EXPERIMENTAL STUDENTS 


————————————ÓÉÓÉÓÉÁÉÓÉÓÁÉÓÉÉÉÉ 
_——Г 


Control Experimental 
SE AEE RS RSS quic Pardi grin P NN 
N 85 76 
Mean 2.24 2.28 
Sigma 0. 69 0. 65 
Difference in Means - 0.04 
ақа Ioco M НАК - nil 


ee 


TABLE III 
COMPARISON OF MEAN FINAL EXAMINATIO! 


IN SCORES OF SUPERIOR, 
AVERAGE AND WEAK STUDENTS IN CONTROL AND EXPERIMENTAL GROUPS 
Control Experimental Difference P 
Superior 345 325 20 2.05 
Students (8-11) (№9) 
Average 278 258 20 <.05 
students (N=34) (N=33) 
Weak 242 216 26 nil 
students (N=11) (N=5) 
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structor contact could be accomplished ina number 
of ways. Reduction in the size of classes, for ex- 
ample, should enable instructors to deal more di - 
rectly with individual students during the lecture 
hours. Such a reduction in class size would be par- 
tially achieved by the removal of superior students 
to fill the honors sections suggested above. Another 
means of increasing instructor-student contact would 
be the introduction of the quiz section method in 
which small groups of students meet frequently with 
teaching fellows or graduate assistants. Because 
this and the foregoing recommendation regarding 
honors sections are essentially complementary,both 
might be followed without increased demands of 
space or instructor time. 

The above recommendations are, of course, ten- 
tative, pending further study. A number of prob- 
lems prevent drawing firm conclusions from the 
present investigation. One of these concerns the 
problem of full articulation of the selected reading s 
used in the Experimental sections with material 
contained in the basic textbook. Since the Control 
students were not required to read the former ma- 
terial, direct questions could not be asked of both 
groups on this subject matter. The final examina- 
tion used as a basis of comparison dealt most di - 
rectly with material contained inthe basic textbook. 
It is possible that the Experimental students,be- 
cause of their additional reading, gained a broader 
knowledge of psychology than did Control students. 
Of course, muchof ће selected reading overlapped 
with the content of the basic textbook, and it was 
hoped that it would supplement and strengthen stu- 
dent understanding of the fundamental princ iples 
presented in the basic text. 


Certain other changes in the procedure might 
possibly produce different results in an otherwise 
similar experiment. Special techniques might be 
introduced to improve the guidance of Experimental 
students in their extra reading, sothatit would be- 
come more relevantto the basic principles of gen - 
eral psychology. One such technique would be the 
use of study questions, which direct reading in a 
more selective and emphatic fashion. 

Another factor that might have entered into the 
Obtained results was the unfamiliarity of the experi- 
mental method employed. Students accustomed to 
meeting three hours a week in athree credit cours e 
simply might not be able, in the short span of one 
semester, to assumethe responsibility and develop 
the study habits which are requiredfor the Experi- 
mental method. Onthe other hand, if such a method 
were to be used ona wide scale, sufficient practi ce 
with it on the part of students mightresult in supe- 
rior performance. A closely related question con - 
cerns the extent to which students in the Experi- 
mental groups actually read the selected readings. 
Even though they were quizzed on it, they may not 
have spent sufficient time to make the resulting gain 
comparable to that enjoyed due to the additional 
class meetings experienced by students taught in 
the conventional manner. 

As is true of other kinds of inquiry, this type of 
research may well raise as many questions as it re- 
solves. The foregoing are some of the problems 
issuing from the present research. It is hoped, 
however, that this study serves the dual purpose of 
providing some basis for current instruct ional 
practices as well as suggesting the direction of ad- 
ditional investigations. 
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Differential Need Patterns of Women Elementary and 


Secondary Level Student Teachers 


VOCATIONAL PSYCHOLOGISTS agree that 
need structure is an important determinant of vo- 
cational choice. This pointof viewisexpressed by 
Super (1957), Roe (1956), Hoppock (1957), and oth- 
ers. Hoppock states that the occupation which one 
chooses is the one which he believes will best meet 
the needs that most concern him. Needs m ay be 
either intellectually perceived or may be only 
vaguely felt attractions. In either case they tend 
to influence occupational choice, 

Several recent studies have attempted to deter- 
mine the need patterns of elementary and secondary 
teachers by means of the Edwards Personal Pref- 
erence Schedule (1952). This instrument is a 225 
item forced-choice inventory which purports to 
measure fiften of Murray's (1938) manifest needs. 

Jackson and Guba (1957) administered the Ed- 
wards Personal Preference Schedule to 196 female 
elementary teachers and fifty-two female secondary 
teachers in public schools of the Chicago suburban 
area. They found that the elementary teachers 
scored significantly higher than the norm group on 
deference, order and endurance and significantly 
lower on exhibition, autonomy, dominance and het- 
erosexuality. Female secondary school teachers 
scored significantly higher than the norm group on 
deference, order and achievementand significantly 
lower on exhibition, dominance, change and heter- 
osexuality. 

Tobin (1956), who administered the Edwards 
Personal Preference Schedule to thirty-seven fe- 
male elementary and secondary teachers in the 
Puget Sound area of Washington state, reports that 
teachers scored significantly higher than the norm 
group on deference, order andendurance, and sig- 
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nificantly lower on dominance, heterosexuality and 
aggression. 

In a study of 100 physical educationte achers 
Thorpe (1958), found that her group scored signifi - 
cantly higher than the norm group on deference, 
order, exhibition, dominance and endurance, and 
significantly lower on autonomy, abasement, 
change and heterosexuality. 

АП of the investigators cited have found that in- 
Service teachers score significantly higher than the 
norm group on deference, order and endurance and 
significantly lower on heterosexuality. With the 
exception of the physical education teachers, 
inservice teachers were also foundto score signif- 
icantly lower than the norm group on dominance. 

Only two studies have investigated the need pat- 
terns of women students preparing for the teaching 
profession. In neither study were need scores of 
Students preparing for elementary teaching com - 
pared with those of students preparing to teach at 
the secondary level. Tobin's (1956) sample in- 
cluded students preparing forteaching at both les 
els. Presumably most of the students in Thorpe's 
(1958) sample were preparing to teach at the sec- 
ondary level. д 

Tobin (1956) found that the eighty students in his 
sample scored significantly higher than the norm 
group on deference, order and endurance and sig- 
nificantly lower on dominance, heterosexuality and 
aggression. This was the same pattern which To- 
bin found for his inservice sample. 

Thorpe (1958) found that the 100 students in her 
sample scored significantly higher than the norm 
group on endurance and significantly lower on au- 
tonomy, succorance and aggression. 


SCANDRETTE 377 


This is aconsiderably different need pattern from 
that which was found for inservice physical educa - 
tion teachers. With the exception of a significantly 
higher score on endurance and a significantly lower 
score on aggression, the finding of Thorpe's study 
differed considerably from the findings of the Tobin 
(1956) study. 


Purpose 


The purpose of the present study was to determine 
whether scores onthe various need variables of the 
Edwards Personal Preference Schedule would sta- 
tistically discriminate between (a) students prepar- 
ing for teaching at the elementary 1 evel and Ed- 
ward's (1952) liberal arts norm group; (b) students 
preparing for teaching atthe secondary leveland the 
norm group, and (c) students preparing for teaching 
at the elementary level and students preparing for 
teaching on the secondary level. No previously re- 
ported study has attempted to compare Edwards 
Personal Preference Schedule scores of students 
preparing for teaching at the elementary and sec- 
ondary levels. 

Because elementary and secondary teaching dif- 
fers with respect to the age of students, the nature 
of the subject matter, teaching techniques and sex 
composition of the teaching force, it was hypothe- 
sized that students who had chosen elementary 
teaching would have different n eed patterns th an 
those who had chosen secondary teaching. 


Procedure 


Subjects were women elementary level and sec- 
ondary level student teachers at Wheaton C olleg e, 
Wheaton, Illinois. The sam ple consisted of all 
women students doing student teaching during three 
semesters. Seventy-three students taught at the 
elementary level and eighty-nine at the secondary 
level. Members of both groups completed the Ed- 
wards Personal Preference Schedule at student 
teaching seminar meetings. Mean scores for each 
of the fifteen “пеебв” purportedly measured by Ed- 
wards Personal Preference Schedule were computed 
for both groups. The “t? ratio was used totes t 
the significance of the differences (a) between the 
means of the two samplegroups and thenorm group 
and (b) between the means of the two sample groups. 


Results 


Findings are summarized in Table I. Women 
elementary level student teachers made signifi- 
cantly higher mean scores than the norm group on 
deference, order and affiliation and significantly 
lower mean scores on autonomy and aggression. 

Women student teachers on the secondary level 
made significantly higher scores than the norm group 
on order and dominance and significantly lower 


Scores on affiliation and succorance. 

Significant differences between the mean scores 
of elementary and secondary student teachers were 
found for autonomy, affiliation, dominance and ag- 
gression. Secondary level student teachers were 
significantly more autonomous, dominant and ag- 
gressive; elementary teachers were more affilia - 
tive. 

Comparison of mean need scores in Table I indi- 
cates that students preparing for secondary school 
teaching have scores more similar totheliberal 
arts norm group than is true of students preparing 
for teaching on the elementary level. 


Discussion 


The present study indicates that women students 
preparing for elementary teaching have significantly 
different need patterns than students preparing for 
teaching at the secondary level. Discrepancies be- 
tween need patterns of women student teachers found 
in previous studies may be explained by the fact that 
students from one level have been compared with 
students from both levels lumped together. 

Students preparing to teach non - academic sub- 
jects, such as physical education, may. also have 
somewhat different need patterns than those prepar- 
ing to teach academic subjects. 

Differences in need patterns of student teachers 
and inservice teachers can probably be accounted for 
by differences in age, experience and selective at- 
trition. Teachers with three or fewer years of teach- 
ing experience іп the Jackson and Guba study (1957), 
had need patterns more similar to those of student 
teachers in the present study than was true of their 
veteran teacher group. 

If the need patterns of elementary and secondary 
level student teachers found in the present study 
should prove to be generally true, and if these pat- 
terns are significantly related to successful teaching 
at the respective levels, the Edwards Personal Pref- 
erence Schedule may prove tobe a useful instrument 
for screening and counseling. Astudy is now under- 
way to determine whether students who have the 
typical need patterns receive significantly higher 
student teaching ratings than those who do not. 


Summary 


The purpose of this study was to determine 
whether women students preparing to teach on the 
elementary level have different need patterns than 
students preparing to teach on the secondary level. 
Seventy-three student teachers preparing to teach on 
the elementary. level and eighty- nine students pre- 
paring to teach at the secondary level completed the 
Edwards Personal Preference Schedule. Mean 
scores for the two groups on each of the fifteen needs 
purportedly measured by this instrument were com- 
pared. Statistically significant differences between 
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TABLEI 


MEAN SCORES OF ELEMENTARY LEVEL STUDENT TEACHERS, 
SECONDARY LEVEL STUDENT TEACHERS AND THE NORM GROUP 
ON VARIABLES OF THE EDWARDS PERSONAL PREFERENCE SCHEDULE 


Need Variable Elementary Secondary Norm Group 
(N = 73) (N = 89) (N = 749) 

Achievement 12.25 13.15 13.08 
Deference 14.14 (a) 12. 64 (c) 12.40 
Order 12.75 (b) 11.54 (а) 10.24 
Exhibition 13.93 14.19 14.28 

€ Autonomy 11.16 (b) 12. 64 (c) 12.29 
Affiliation 18. 33 (a) 16. 33 (d) 17.40 
Intraception 17.19 18.30 17.32 
Succorance 12.16 10. 84 (b) 12.53 
Dominance 13.81 15. 36 (a, c) 14.18 
Abasement 14.97 14. 55 15.11 
Nurturance 16.60 15.73 16.42 
Change 17.40 17.81 17.20 
Endurance 13.20 13.13 12.63 
Heterosexuality 13.45 13.51 14.34 
Aggression 8.85 (b) 10. 17 (c) 10.59 


(a) Differs significantly from norm group, Р= < .05 
(b) Differs significantly from norm group, P- <.01 
(c) Differs significantly from elementary group, P =<.05 
(d) Differs significantly from elementary group, P =<.01 
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the mean scores of elementary and secondary stu- 2. 
dent teachers were found for autonomy, affiliation, 

dominance and aggression. Generally speaking, the 3. 
need patterns of students preparing to teach onthe 
secondary level were more similar to Edwards’ 
liberal arts norm group than was true of students 
preparing to teachon the elementary level. Differ - 
ential need patterns of elementary and secondary 
student teachers were suggested as a possible expla- 5. 
nation of discrepancies between the need patterns 

of student teachers found in previous studies. The 6. 
possibility of using the Edwards Personal Prefer- 

ence Schedule as a screening instrument was sug- T. 
gested. 
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GINN AND COMPANY OF BOSTON, a leading publisher of school text- 
ndy Organization of Detroit, producer of audio- 


books, and the Jam На! 
visual materials, jointly are producing film strips for elementary and 
high-school classes in all subject areas. These film strips will have 


ial supervision to provide visual reinforcement, prac- 
additional practice material, and functional review 
of textbook and visual materials 
gram for teachers and 


competent editor: 
tical applications, 
for the textbooks. This correlation 
should offer a balanced instructional proj 
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Readers are invited to send items of research interest to 
John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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Attitude Change of Elementary School Student Teachers 


And Anxiety 


ATTITUDES of teachers toward children are 
closely associated with pupil-teacher relations found 
in classrooms. While considerable study has been 
made of teacher attitudes, correlates of these atti- 
tudes with the anxiety levels of the teachers have 
not been explored. This study deals with manifest 
anxiety as a variable in attitude change during stu- 
dent teaching, serving as a preliminary step in the 
study of the effects of anxiety upon the performance 
of student teachers. 


The Problem 
This study was focused upon three hypotheses: 

l. Student teachers possessing strong anxi- 
eties may be expected to change their at- 
titudes in the direction of their Supervis- 
ing teacher's attitude. Such a change 
might be accounted for in terms of too 
ready conditioning of fear responses, a 
view supported by Eysenck (4) Franks 
(6) and Spence (8). 

2. Highly anxious studentteachers will 
perceive their pupils negatively and 
their supervising teacher as negative 
toward children. This hypothesis rests 
upon the clinical belief that stress is as— 
sociated with feelings of less favorable 
attitudes toward self and others. 

3. Student teachers, whose initial attitude 
scores are lower than those of the Super- 
vising teacher, will change inamore 
positive direction. Findings ofScott and 
Brinkley (7) support this basic assump- 
tion. 


ubjects and Procedure 
The Minnesota Teacher Attitude Inventory 
MT Al) (3), was given to 91 elementary school stu- 
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dent teachers enrolled in the University of Califor- 
nia, Los Angeles, upon entry and at the end of a 
semester of student teaching. These students taught 
one-half day five days each week for one semester. 

The Pittsburgh revision of the Taylor Manifest 
Anxiety Scale (MAS) and the Anxiety Differential (1) 
were used to secure anxietyscores for these stu- 
dent teachers. Student teachers were asked to de- 
scribe the attitudes of their individual supervising 
teachers at the completion of their teaching assign- 
ment. This was done by asking them to check the 
MTAI the way they believed their supervising teach- 
ers would respond to this attitude scale. Supervi- 
Sors, without any collaboration with student teach 
ers, and unaware of the students' d es c riptions, 
completed the inventory. Letter grades, a detailed 
characterization, and a check list of teaching skills 
were prepared by supervising teachers for each 
Student teacher. A sample group of 150 candidates 
for teaching who were not in student teaching 
responded to the MTAI at the beginning and end of 
the semester. 


Findings 4 

The results of this study will be considered within 
the following headings: (1) correlation between in- 
Struments; (2) differences between highly anxious 
and non-anxious teachers; (3) change associat ed 
With student teaching. 


Correlation between instruments. Student teach- 
ers were differentiated by using the Pittsburgh re- 
vision of the MAS and the Anxiety Differential. The 
relationship between scores made on these two in- 
struments (r=. 28), was significantat the . 001 level. 


Differences between highly anxious and non- 
anxious teachers. Correlates between change 10 
anxious teachers: 
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attitude for high and low scores onthe anxiety scale 
are shown in Table I. Note that both highly anxious 
(high 25% taking the MAS), andthe non-anxious (low 
25% taking the MAS), elementary school student 
teachers changed their attitudes toward youth іп а 
negative direction. The number who changed, 80%, 
and the amount of change was approximately the 
same for both groups. 

While there were a few students who made ex- 
treme changes, the majority of the elementary 
school student teachers hada change in score on the 
on the MTAI (after completing student teaching) of 
nine or more points. The overall magnitude of 
change was not significantly greater between stu- 
dents who were highly anxious and those who were 
non-anxious. Neither was there a notable relation 
between accuracy in perception of the supervisor' s 
attitude and anxiety level of the student respondent. 
Highly anxious student teachers perceived their in- 
dividual supervising teacher as holding negative at- 
titudes toward youth. Of this group (top 25 highly 
anxious), ten perceived their individual supervising 
teacher as positive but much below the actual level 
expressed by these teachers, and five perceived 
their supervising teacher as positive and higher 
than the supervising teacher's own rating. Non- 
anxious student teachers perceived their individual 
supervising teacher as less negative than did the 
highly anxious: four perceived their supervisi ng 
teacher as highly negative toward children, thirteen 
perceived their supervising teacher as positive but 
below the actual level expressed by the supervising 
teacher, while eight perceived th eir supervising 
teacher as more positive toward children than the 
supervisors actually were. 

Anxious student teachers, who were initially 
positive in their attitudes, changed in the direction 
of both the attitudes independently reported as de- 
sirable by their supervising teachers and theatti — 
tudes student teachers perceived tobe those of their 
supervising teachers. The three student teachers 
who were initially negative in their attitudes likewise 
changed in the direction of their supervising teach- 
ers' attitudes. Among non-anxious student teach- 
ers, however, slightly more than half (13 out of 25), 
changed their attitudes in the direction of the atti- 
tudes independently reported as desirable by their 
supervising teachers. This same group did change 
their attitudes toward the perceived attitudes they 
had given their supervisors after completing Stu- 
dent teaching. 


Change as associated with student teaching. Pre- 
vious studies show thatthe attitudes of the teachers 
are likely to improve during t he student teaching 
period, while t h e experience of regularteachi ng 
(full time with pay), may decrease MTAlscores. In 
the present study, significant changes occurredin а 
negative directionto ward children on the MTAI (to- 
tal score), during student teaching. In 22 percent 


of the cases the scores showed a positive gain, but 
in 78 percent of the sampling, changes were in a 
negative direction. The range of negative change 
was from 1 to 32 points, with a median of 9.7, 0, 
of 3. 75, and ©; of 15.06. Noteworthy is the finding 
that this change was in the direction of the attitudes 
held by the supervising teachers. Both highly anx - 
ious and non-anxious student teachers had negative 
changes in attitudes in the direction of their super- 
vising teachers. 

Scott and Brinkley (7) reporteda slight degree of 
association between attitude changes of student 
teachers and the attitudes of their supervisingteach- 
ers. The relationship was restricted to instances 
in which the student teachers made lower initial 
Scores than their supervisingteachers. The change 
in attitudes was reported to be in a more positive 
direction. In this study, 520fthe 91 student teach - 
ers had higher scores on the MTAI than their super- 
vising teachers, 4 had the same scores, and 35 had 
lower scores than their supervising teachers. Of 
these 35 student teachers with initial scores on the 
MTAI lower than their supervising teacher, there 
were 21 who changed in a negative rather th an a 
positive direction. Thus the findings of this study 
show attitude change of student teachers inthe direc- 
tion of supervising teachers regardless of initial 
Scores. 5 

Students (іп the sample group of 150), who were 
not in student teaching responding to the MTAIat the 
beginning and end of the semester maintained hi g h 
positive scores toward pupils. The scores did not 
change measurably during the semester they were 
in methods courses. 


Conclusions 

Student teachers can be differentiated by using 
existing measures ofanxiety. These differences are 
related to their influencability and to attitudinal 
changes toward youth. There was some support given 
to the hypothesis that the highly anxious student 
teacher who expresses negative attitudes will tend 
to perceive pupils negatively as he works with them 
during student teaching. Within (һе limits of this 
study, the ratings of supervising teachers donot in- 
dicate that high anxiety has a detrimental effect upon 
student teaching. 

Both highly anxious and non-anxious elementar y 
School student teachers changed their attitudes 
toward youth in a negative direction. This study 
raises questions about the nature of student teach- 
ing, suggesting that the experiences provided are 
either very realistic and similar to regular teaching 
situations or perhaps unduly stressful. The influ- 
ences of student teaching maybe a critical factor in 
the development of teachers’ attitudes toward chil- 
dren. Students not instudent teaching(150 s t uden ts 
in methods courses), maintained their high positive 
attitudes toward children throughout the semester. 

A study should now be made to identify specific 
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TABLEI 


CHANGES IN ATTITUDE AND ANXIETY 


Highly Anxious Non- Anxious 
Student Teachers Student Teachers 
Percentage who changed 
attitude negatively toward 80% (N-25) 80% (N-25) 
pupils 
anxieties developed during supervised teaching and 4. Eysenck, H.J. The Dynamics of Anxiety and 
to determine factors causing atti tudinal change Hysteria. London: Routledge and Kegan Paul, 
while working with pupils and supervisors. T957 


5. Eysenck, H.J. “ Learning Theory and Moral 
Values in Children," The British Journal of 


Educational Psychology, 30: Part I, February 
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An Experimental Study of the Use of Lectures to Large Groups 
of Students in Teaching the Fundamentals of Speech: 


ADMINISTRATORS IN the area of speech, like 
administrators in other academic areas who are in 
charge of one or more ''required" courses, are 
finding it necessary to search for pedagogical pro- 
cedures by which teacher loads can be decreased 
without lessening educational effectiveness. Unlike 
most other departments, however, speech depart- 
ments ordinarily must concern themselves with 
bringing about increases in student speaking skill as 
well as satisfactory assimilation by the students of 
the subject matter covered by the beginning course. 

Experience as well as instructional theory have 
led to the conclusionthatthe best method of bringing 
aboutimprovementinspeaking skill is that in which 
students are given frequent opportunities to perform 
(or practice) under the critical and analytic guidance 
of an instructor. It was with the aim of discovering 
ateaching method which would provide these frequent 
opportunities for practice, and, at the same time 
reduce teacher load, that the present study was 
undertaken. 


Procedures 


More specifically, the purpose of this study was 
to determine whether the change in student perform- 
ance brought about by the use of two methods of teach- 
ing Fundamentals of Speech was significantly differ- 
ent. These two methods are described below. 

The first method was that which is customarily 
employed at the University of Kansas. By this method, 
each class (approximately 20 students) meets three 
times a week with its instructor. Assignments are 
made in a standard text. Class time is primarily 
devoted to the delivery and criticism of short (46 
minute) student speeches. Presentation of theory by 
the instructor is incidental to the need exhibited by 
Students in each class; an occasional class discus- 
sion and/or shortlectureis customary. These clas- 
Ses hereafter will be referred to as the “control”? 
group. 

The second method of teaching employed one 
"'mass-lecture'' per week by an experienced member 
of the Department of Speech and Drama.1 

The other two weekly class periods for the ex- 
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perimental group were devoted to short student 
speeches followed by analysis and criticism by the 
instructor. Approximately 14 students were enrolled 
in each of these **experimental"' classes. 

Тһе essential differences between the two methods, 
then, were as follows: (1) Control classes contained 
approximately 14 each. (2) Control classes met three 
times a week with ‘‘their own’’ instructor at which 
time they normally delivered speeches which were 
criticized by the instructor with occasional discus- 
sion of theory: experimental classes met twice a 
week with “their own” instructor for practice and 
criticism and once a week for a ‘‘mass-lecture, ”’ 
demonstration or movie. 

It was assumed that enrollment procedures at the 
University of Kansas would result in the enrollment 
of a representative sample of F unda mentals of 
Speech students in each of the two groups. А '*sec- 
tioning” procedure is used at enrollment whereby 
students are arbitrarily assigned to particular 
classes of Fundamentals of Speech on the basis of 
the free time remaining after enrollment in other 
courses. Personsin charge of the **sectioning'' were 
unaware that the experiment was to take place. A 
total of 297 students were involved in this study in 
its two-semester duration. 

The following variables were controlled: (1) The 
same graduate assistants served as instructors for 
both experimental and control groups. (2) АП classes 
in both groups met in the morning on Tuesdays, 
Thursdays, and Saturdays. (3) Control and experi- 
mental classes met in the same or incomparable 
classrooms. 

Students were evaluated by their performance in 
two areas: knowledge of speech principles, and 
speaking skill. A description of the evaluative pro- 
cedures follows. 

A rating scale was devised to be used by all judges 
of speaking skill in the project. The type of scale 
selected was a simple 1-10 point scale employing 
only the criterion ‘‘General Effectiveness. » This 
Scale was similarto that developed by Monroe? and 
has been favorably compared with other rating in- 
struments by Thompson. 3 

Panels of three judges, all teachersof the Fund- 
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amentals of Speech course atthe University of Kan- 
sas, were assigned to judge problem-solving 
speeches deliveredat each class session of the oral 
pre-test (during the third week of the semester) in 
both groups. Inafewcases, whenteachers were not 
available, upperclass students with credit in ad- 
vanced speech courses acted as judges. The mean 
ratings assigned by a panel of three judges on the 
1-10 point speaker rating scale were assigned as 
Student scores on the oral pre-test. 

During the same week in which the oral pre-test 
was conducted, a written examination of speech 
principles devised by Barnes was administered. 4 
This examination consisted of 55 multiple-choice 
andtrue-false questions. It had undergone several 
revisions based on tests of validity and reliability. 
The number wrong was assigned as each student's 
Score on the written pre-test. 

During the fifteenth week of each semester, the 
oral and written tests were repeated. A different 
form of Barnes' written examination was used. 

The data were analyzed by the following method: 


(a) The level of significance was set at 505 


(b) The critical region was selected as - 
two standard errors from the mean. 


(c) Inorder to determine whether statistic- 
ally significant changes were apparent 
within eithergroup, comparisons were 
made between: 


(1) Control oral pre-test and control oral 
post-test; 

(2) Control written pre-test and control 
written post-test; 

(3) Experimental oral pre-test and ex- 
perimental oral post-test: and 

(4) Experimental written pre-test and 
experimental written post-test. 


(d) Inorder todetermine whether a signif i- 
cant difference existed in the changes 
exhibited between the two groups, a 
comparison was made between the con- 
trol oral post-test and the experimental 
oral post-test and between thecontrol 
written post-test and the experimental 
written post-test. 


The same procedures were followed when the ex- 
periments were repeated during the Spring semester 
of 1959. 


Results 


Table I shows the results of this study for the fall 
semester, 1958 - 1959. 

According to the data presented in Table I, both 
groups made statistically significant changes in per- 
formance in both areasof evaluation. That is, both 


groups made significant upward shifts in knowledge 
of speech principles and in speaking skill. However, 
the scores of the two post-tests show that there is 
no significant difference between the two groups at 
the end of the semester either in knowledge of speech 
principles orin speaking skill. As a matter of fact, 
the mean scores of the two groups are almost identi- 
cal for each of the post-tests. 

Table II shows the results of this study for the 
Spring semester, 1959. 

The data presented in Table II verify the results 
obtained in the fall semester experiment. Again, the 
mean post-test scores for both groups in speaking 
Skill and in knowledge of the principles of speech 
were well above the critical region of two positive 
Standard errors from the pre-test means. In other 
words, both groups made significantchanges in 
knowledge of speech principles and in speaking skill 
during the spring semester, 1959. Тһе spring sem- 
ester experiment also verified the fall semester 
finding that no significant difference exists between 
the mean post-test scores of the two groups. 


Conclusion 


According tothe results of this study, there was 
no statistically significant difference in the post-test 
performance of groups subjected to the two methods 
of teaching Fundamentals of Speech described in this 
paper. Both groups, however, made significant pos- 
itive changes in knowledge of speech theory and in 
Speaking skill between the beginning and the end of 
each of two semesters. Measurable results of ће 
two methods compared were almost identical. It 
appears that the **mass-lecture''method of teaching 
the beginning speech course as described in this 
study can be employed without significantly dimin- 
ishing the amount of learning. 
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Bowers (M.A., Kansas, 1959) was formerly à 
research assistant in the Department of Speech 
and Drama at the University of Kansas; he 18 
presently a teaching assistant at the State Uni- 
versity of Iowa. 

1. Inthe present study, the lecturer was prof. 
E.C.Buehler, Director of the Fundamentals of 
Speech program. 

2. А.Н. Мопгое, H.H. Remmers, and Elizabeth 
Venemann-Lyle, ‘‘Measuring the Effectiveness 
of Public Speech in the Beginning Course, "" Bul- 
letin of Purdue University, Studies in Higher Е4- 
ucation, б), No. T. 

3. Wayne М. Thompson, “Ап Experimental Study 
of the Accuracy of Typical Speech Rating Tech- 
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niques, ’? Speech Monographs, 11 (1944), 65-79. Basic Speech Performance Course," Unpub- 
4. Ted J. Barnes, “А Study of the Reliability and lished M.A. Thesis, the University d Kan- 
Validity of a *Quiz-Out' Testing Procedure for the sas, 1957. 
TABLE I 


MEANS, STANDARD DEVIATIONS AND STANDARD ERRORS, 
Y FALL SEMESTER, 1958-1959 


ee ee -------..------.- . 6 
Test N S.D. 5.Е. Mean _ 
Control oral pre 73 1.3 .15 5.4 «4.30 
Exp. oral pre 46 1.4 .21 5.34.42 
Control written pre 69 4.3 52 22  :1.04* 
Exp. written pre 52 5.0 69 22 +1.38* 
Control oral post 73 1.4 .16 6.3 + .32 
Exp. oral post 46 1.4 21 6.42.42 
Control written post 69 4.1 .49 16 +.98* 
52 4.8 .67 17 3134* 


Exp. written post 


жАП scores listed for the written examination are the number wrong. 


TABLE II 


MEANS, STANDARD DEVIATIONS, AND STANDARD ERRORS, 
SPRING SEMESTER, 1959 


НЕ 7 5 —— ааа. 
Test N S.D. S. E- Mean 
Control oral pre 97 1.3 .13 5.04 .26 
Exp. oral pre 75 1.3 .15 5.04 .30 
Control written pre 92 5.3 55 22 +1.10* 
Exp. written рге 75 4.1 4 22 + .94* 
Control oral post 97 1.1 1 6.4+ .22 
Exp. oral post 75 1.2 ‚14 6.4+ .28 
Control written post 92 4.2 .44 16 + .88* 
15 4.4 ‚51 16 +1.02* 


Exp. written post 


жАП scores listed for the written examination are the number wrong. 
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А Comparison of European and American "Educational Influences" 


JAMES DRASGOW 
University of Buffalo 


WE HAVE LONG talked in America about possi- 
ble differences between European and American ed- 
ucation but nobody has done much about it. At the 
recent World's Fair in Brussels, there were sever- 
al formal and informal meetings of international 
groups of educators. Nearly e veryonepresent 
Seemed to feel that his neighbor's educationalgrass 
was greener. 

To provide an empiricalcomparisonof European 
and American systems, data were collected on pu- 
pils who had been exposed to both influences. * Ran- 
dom samples of several hundred children were se- 
lected by drawing folders sequentially from several 
different schools’ alphabetically ar ranged files. 
School grades, academic marks, IQ's, ages, sexes, 
Social development and so on were all tabulated. 

In brief, the experimental design consisted of 
comparing school marks from the ‘‘academic’’ and 
**social'' areas on pupils who hadattended both Eur- 
ореап and American schools. In order to control 
for variables such as length of time under one influ- 
ence, different grading systems, etc., the “пеаг- 
est-to-pure'' groups available were selected. These 
меге first graders who had taken German, French, 
Italian or American kindergartens before en tering 
the first grade. A control group of first graders 
with no kindergarten background was also included. 

After matching groups on sex and age and then 
finding no significant difference between groups on 
IQ, the distribution of school marks was plotted 
(see Table below). 

These data indicate that there are nodifferences 


BRUCE C. STOCKIN 
V. A. Hospital, Columbia, S.C. 


beyond chance (chi-square P =.10) in either the 
academic or in the social marks which pupils re- 
сеіуе in first grade after having completed a previ- 
ous year in either a European or an American 
School. If the school marks are an acceptable cri- 
terion, this also means thatitdoes not matter whe- 
ther a child goes to an American or to a European 
School first. 

Comparison of the control group's distribution of 
marks with the other two groups reveals larger 
within-group variance or heterogeneity in the no- 
kindergarten group. If **homogenization" is desir- 
able and possibly related to ‘‘s oc ialization’’, then 
kindergarten apparently does it. The dataalsoshow 
that ten or more times as many non-kindergarten 
children receive lower social marks than do child- 
ren who have had a kindergarten experience. While 
kindergarten apparently helps inthe achievement of 
“average” or better social marks, it seemingly 
doesn’t matter whether it’s an American or Euro- 
pean one. 


Conclusions and Considerations 
L9one*usions and Considerations 


The difference between European and American 
education may not be as great as often suggested. 
It could be, of course, that comparisons madeat 
more advanced grade levels might reveal effects of 
educational influences which aren't apparent in 
first-graders. But at the present time, with the a- 
vailable data, it looks as if the matter of education- 
ally greener grass may be an illusion. 


DISTRIBUTION OF FIRST-GRADE MARKS FROMCHILDREN WITH 
EUROPEAN, AMERICAN AND NO-KINDERGARTEN BACKGROUNDS 


Academic Marks* 


High ^ Average 


European 2% 94% 
Атегісап 4% 91% 
None 12% 56% 


“Ауегаве” marks were statistical 
standard deviation of the mean. ‘ 


scores beyond one sigma. 


Е oio EU Pe LS 
* These data were collected in Europe (France, 
tesy and cooperation of the U.S, A. Dependents' 
the Elementary Education Specialists and the El 


Social Marks 
High ^ Average Low 
0% 98% 2% 
3% 94% 3% 
3% 64% 33% 


ly defined as all scores within one 
‘High”’ and “low” marks were all 


Germany and Italy) and were made available through the cour- 


Education Group in Europe's Elementary School Coordinator, 
ementary School Guidance Counselors. 
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PHILOSOPHY OF OUTDOOR EDUCATION, William 
H. Freeberg and Loren E. Taylor. Minneap- 
olis: Burgess Publishing Company, 1961. 
x + 445 рр. 


The message of the book is this: Direct sen- 
sory experience is good education and since outdoor 
education makes principal use of direct sensory ex- 
perience it is good education. 

This is a useful volume in several respects. 
The authors, both on the faculty of Southern Illinois 
University, have focused their book оп “©... {һе mean- 
ing and significance of outdoor education in the 
school curriculum."' They say: 


Outdoor education is a method of education 
and one that can be successfully and inte li- 
gently introduced by all teachers, in all sub- 
ject-matter areas, to supplement and comple- 
ment written and oral expression. 


They state this without limitation, encouraging the 
inference that this is atoolas applicable to the study 
of home economics as to natural history. 

A stronger case could be made for outdoor ed- 
ucation if it were looked upon as an approach to the 
study of natural science which uses the natural en- 
vironment as both the place and the m aterials of 
learning, Then the values cited would have specific 
meaning: the integrative value that comes from ob- 
serving a biological or physical phenomenon in its 
natural setting; the development of powers of obser- 
vation under guidance and encouragement; the learn- 
ing and the physical fitness values that arise from 
being active in the fresh air; the spiritual values 
fostered by a moment of meditation under the spell 
of the outdoors. 

There are good chapters that reveal in broad 
strokes the sweep of social change; the nature and 
functions of education per se; the historicaland phil- 
osophical foundations of education; the history of 
camping and the impact of the new leisure. The sub- 
stance of these is generally well done but there is no 
attempt to draw outof these rich resources the par- 
ticular meaning they holdfor o utdoor education. 
Outdoor education has vast possibilities for awaken- 


Book Reviews 


ing interests and, thus, for fostering learning, as 
pointed out by philosophers and educators from Plato 
to Pestallozzi and Rousseau to Dewey and beyond. 

There are too many adjectives for effective 
writing (e. g., ‘‘extremely static"); there are occa- 
sional inaccuracies (е. g., “Forestry Service"); 
misspellings (e. g., ‘‘Hypocratic oath’’), and redun- 
dancies (e.g., leisure time). InChapter V, the au- 
thors take pains to distinguish ‘outdoor education’ 
and ‘camping’ but discuss only the camp as the set- 
ting for outdoor education throughout. The history 
of camping is detailed in Chapter V and again more 
briefly in Chapter XI; the contemporary impact of 
leisure is discussed in Chapter I and again in X. 
Nevertheless the authors have obviously studied with 
diligence much of the substance from which the vol- 
ume grows and they have presented a good part of 
that substance with discrimination. 

There is an unanswered question of whether 
outdoor education has sufficient distinctive substance 
to be offered as aseparate course experience to 
prospective teachers or whether it should be used 
and taught as a method or approach to the learning 
of natural science, the field approach, so to speak, 
alongside or in place of the laboratory approach to 
learning about the world in which we live. 

H. C. Hutchins 
University of Wisconsin 


LET’S READ, Leonard Bloomfield and Clarence L. 
Barnhart. Detroit: Wayne University Press, 
1961. 465 pp. 


Reading instruction inthe public schools comes 
under periodic attack from professional educators 
and from interested citizens outside the profession. 
This is in part the result of the pre-eminent place 
reading occupies in the education of American chil- 
dren. “Some subjects are more important than 
others. Reading is the most important of all," 
writes John W. Gardner, 1% 

Overwhelmingly, the teaching of reading in 
elementary schools is organized to utilize basal 


peto AM ci eer puma es C HERES TS 
*Footnotes will be found at the end of the review. 


388 THE JOURNAL OF EDUCATIONAL RESEARCH 


reading textbooks. Two kinds of charges, not nec- 
essarily related to each other, have been leveled at 
basal reading instruction: the first charge argues 
that most reading texts are based on an jnadequate 
understanding of the mechanics of language and of 
reading, causing initial reading instruction to be in- 
appropriate and productive of a large percentage 
(rarely specified) of failures; the second charge 
challenges the: content of the readers on the grounds 
that controlling reading vocabulary has produced 
vacuous selections which are inadequate to the task 
of introducing children to our literary heritage. 

Leonard Bloomfield’s essay, '* Teaching Chil- 
dren to Read” finds reading instruction wanting in 
the way in which it confronts children with the task 
of learning to read; itgoes on to present an alterna- 
tive based on a linguistic, essentially a phonemic, 
analysis of English. Although parts of the late Pro- 
fessor Bloomfield’s essay have appeared earlier, 2 
the manuscript, originally completed in 1937, ap- 
pears in its entirety for the first timein Let's Read. 
The twenty-three page essay is the heart of this book 
and the reason for its existence, The bulk of the 
remaining 442 pages of the book consists of the ac- 
tual reading lessons for children. To Bloomfield’s 
original 2500 words of initial reading instruction, 
Barnhart has added approximately another 2500 
words, mostly in the advanced sections and mostly 
words of two syllables, which Bloo mfield deliber- 
ately avoided. Barnhart hasalso divided the lessons 
into somewhat shorter units of instruction. 

Bloomfield believes that any reader performs 
two separate tasks: letters inwordsrepresent sounds 
to a reader; getting the sounds from the page, the 
reader performs the secondtask of trying to under- 
stand what these sounds mean. It is only the first of 
these, transforming written symbols into sounds, 
which Bloomfield accepts as defining reading.3 Mak- 
ing sense out of sounds is a task common to respond- 
ing adequately to any use of language. This kind of 
restriction on the domain of the reading teacher is 
consistent withthelinguist's usual restriction of his 
own domain, language. 

Defining language as ‘‘an arbitrary system of 
articulated sounds made use of by a group of humans 
as a means of carrying on the affairs of their soci- 
ety, 4 linguists have discovered that of the vast 
range of sounds possibleto any human speaker, each 
culture selects a small number of distinct and sepa- 
rate sounds out of which to build a language. Each 
member of this finite set of irreducible speech sounds 
is called a phoneme. (English has forty-six phon- 
emes, according to Н. А. Gleason.)9 

Alphabetic writing uses written symbols to 

represent speech sounds. English, although it is 
* extremely imperfect and arbitrary'' (Bloomfield’s 
description), is analphabetic system. Since an Eng- 
lish writer transforms Speech sounds into written 
symbols, a reader of English has the task of using 
the written symbols to help him recreate the original 
Speech sounds. 


* 


Since Bloomfield’s analysis stresses the sub- 
stitution of emphasis on word-form for preoccupa- 
tion with word-meaning, and since the relationship 
of letter(s)-to-sound is central, some readers may 
be tempted to equate this approach with phonic ap- 
proaches to teaching reading. Bloomfield dispells 
any such notion explosively. 


The letters of the alphabet are signs 
which direct us to produce sounds of our 
language. A confused and vague appre- 
ciation of this fact has given rise to the 
so-called **phonic' method of teaching 
children to read. ..these methods con- 
fuse writingwith speech. They plan the 
work as if the child were being taught. . . 
tospeak. Asa matter of fact, nearly 
all six-year-old children have long ago 
learned to speak their native language; 
they have no need whatever of the drill 
which is given by phonic methods. 6 


Phonics errs not only in confusing writing 
with speech, but also in isolating speech sounds. It 
expects the childto produce oncue the sounds of let- 
ters isolated from any word in which they appear. 
Children practice making the *t- sound," for ex- 
ample, either alone or followed by an obscure vowel 
sound. 


English speaking pe ople...are not ac- 
customed to making that kind of noise. 
Learning to pronounce such things is 
something in the nature of a stunt, and 
has nothing to do with learning to read. 
We intendto apply phonetics to our read- 
ing instruction; this does not mean that 
we are going to try to teach phonetics to 
young children. In this absurdity lies 
the greatest fault of the so-called phonic 
methods. 7 


It is important to remember that Bloomfield’s 
description is of practices which he believed were 
common in the mid-1930's.9 The editors make no 
suggestion that anything much has changed since then. 
The flat assertion that ‘actual instruction in our 
Schools consists almost entirely of. . the word meth- 
od,’’ symbol-to-sound relationships saves him from 
the burden of learning separately all of the words he 
must know. If this purported to describe current 
reading instructions, it would be tempting to label it 
absurd. Even a cursory look at the teachers’ man- 
uals of eurrent reading texts reveals that the first 
formal teaching of letter-to-sound relationships 0C- 
curs typically during the first grade. Attention 18 
directed even earlier to aural discrimination of 
sound contrasts, especially in initial consonants. 
Unfortunately we have little assurance that the read- 
ers are being used as their authors intend them to 
be used, which underscores the difficulty of speak- 
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ing with confidence about ‘what the schools are do- 
ing." The discussion in Let's Read exercises little 
caution in this respect. 

if a word method is one which tells children 
what word each group of letters represents, rather 
than forcing them to *«work it out’? for themselves, 
then Bloomfield's is its elf a type of word method. 
(See page 41, *...teacher says, *Now we have 
spelled the word. Now weare going to read it. This 
word is can. Read it: can.’’’) It is а word method 
with this ditference: the words are presented in such 
away that the co ntrasts in sound may be easily 
identified with differences in the letters used to form 
the contrasting words. Particularly in the earliest 
stages of instruction, vocabulary is controlled with 
great care so that each letter represents only one 
sound and each sound is represented by only one let- 
ter. (Strangely enough Bloomfieldfinds it necessary 
to violate this axiom on the very first page of his 
reader where, in such phrases as «а can, a fan, а 
pan", a has boththe regular sound as in “сап” and, 
when itis a word by itself, the irregular sound as 
in “‘bake’’. Itis difficult to understand why this pro- 
nunciation was chosen since, as Bloomfield points 
out, this is rarely the pronunciation of the word “а” 
іп normal speech.) The proc edure for controlling 
the introduction of words simply extends to reading 
instruction the linguist’s method of compiling lists 
of words with minimal differences in order to dis- 
cover those speech dounds which serve as phonemes 
inalanguage. Bloomfield’s thesis is that such a 
presentation, by stressing the correspondence of 
phonemes and graphemes (written or printed sym- 
bols), builds the ability to read upon what is basic 
tothe expressive structure of a language, and in- 
sures children a greater measure of success achieved 
more quickly and easily, under conditions which al- 
low them to become independent readers more read- 
ily. 

Let’s Read suggests 2 num per of intriguing 
hypotheses with respect to teaching reading and es- 
pecially with respect tothe beginning stages of read- 
ing instruction. One group of hypotheses would aim 
at making clear the nature of the relationship between 
reading and correspondences between phonemes and 
their spellings. A group at Cornell is currently in- 


vestigating these relationships as they exist in adult 
readers. 9 Davis and Diman!0 have conducted pre- 
liminary attempts to use Bloomfield-like materials 
with beginning readers in public school classrooms. 

A second set of hypotheses includes those 
aimed at discovering thefunction and causes of mo- 
tivation in learning to read. Some critics of the 
basal reading texts have argued that controlled vo- 
cabularies should not play as importanta role as they 
do in determining content. Bloomfield attacks the 
lack of control (with respect to phoneme-grapheme 
correspondence) which characterizes the presenta- 
tion of words to beginning re адегв. Does the even 
stricter control of vocabulary, needed to avoid am- 
biguity in the 8 pelling of many English phonemes, 
increase initial reading success? Or do the limits 
on content imposed by this control produce material 
so empty of meaning as to make reading impossibly 
tedious? 

Bloomfield predicts that the motivating facet 
of content is very likely irre levant and possibly 
harmful to a beginning reader. The child beginning 
to read is too busy changing the printing into sounds 
to have energy or interest left for listening to а mes- 
sage. Knowing that he has responded to symbols 
with appropriate sounds is motivating enough to keep 
a reader working, even if the product is a nonsense 
syllable, according to Bloomfield. 

Highly meaningful material may be reinforc- 
ing in ways that non-meaningful material is not. It 
may provide a reader with the opportunity independ- 
ently to estimate the correctness of a response. 
This opportunity is denied him to the extent that the 
content of his reading is unpredictable . Whether 
the content of basal reading texts is significantly 
more meaningful, in this respect, than the Bloom- 
field materials is open to question. 

Bloomfield’s ideas and the Bloomfield- Barn- 
hart materials will make a distinct contribution to 
the teaching of reading if they stimulate experiment- 
al activity directed toward verifying hypotheses such 
as these. The end result can only be that we learn 
more about the nature of reading and extend our 
ability to teach children to read successfully. 

B. Robert Tabachnick 
University of Wisconsin 
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the report of the 


Goals for Amer: o : 
President’s Commission on National Goals 


(New York: Prentice-Hall, 1961), p. 8. , 
2. Leonard Bloomfield, “Linguistics and Reading," 
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Synthesizes 75 doctoral studies to offer 
practical insights for practitioners, and 
fresh theories for further research: 


e What are the criteria of teacher effectiveness? 


e What are the pertinent. theories of the 
nature and structure of human abilities? 


e Have statistical techniques been adequate? 


ө Are there discrepancies between teacher 
potenial and performance? 


e What are the personal and professional 
prerequisites to teacher effectiveness? 


e Can good teachers be distinguished from 
poor teachers? 
* 


This milestone monograph presents a 
critical overview of 30 years of important 
research into teacher effectiveness, 

and offers such general observations 

and new hypotheses as appear to 

be supported by the data. 
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Readers 
Write 


(The JER) is perhaps, by all counts the 
one journal which appeals to a large 
number of readers. I suspect that it has 
more subscribers than any half dozen 
comparable magazines combined. . . . 
For myself, I should like to see The JER 
publish from time to time articles of a 
semi-philosophical nature. For example, 
some one might wish to prepare a paper 
indicating just what Theories of Learn- 
ing mean for the classroom teacher.— 
ROBERT A. DAVIS, Professor of Edu- 
cational Research, George Peabody Col- 
lege for Teachers, Nashville, Tenn. қ 


What I like most about Тһе ТЕҢ is that 
it is written in English, because English 
is the only foreign language I can under- 
stand. What I like least about The JER 
is that it is expensive for foreign teach- 
er.—TAKEO YASUDA, science teacher, 
Junior High School attached to Tokyo 
University of Education. 


The new format and content are excel- 
lent. The JER represents a fine balance. 
Just a personal bias: I would like to see 
more research reports at the elementary 
school level. -ROBERT L. HILLERICH, 
pan, School Principal, Glenview, 


Your "It Says in The JER" section moti- 
vates the reader to seek out the detailed 
research report presented elsewhere in 
The JER. Research reports too frequently 
are several years old.—M. P. CAPPS, Di- 
rector of Testing, Virginia State College, 
Norfolk, Va. 


I like your content and arrangement; I 
dislike the small print. —FRANK H. 
GURLEY, Director of Research, Infor- 
mation, and Development, Contra Costa 
County Schools, Pleasant Hill, Calif. 


Ilike The JERS concise presentation of 
profiles and reports of research—J. Т. 
WILLIAMS, Principal, Carner High 
School, Gadsden, Ala. 


Some of your research reports are for а 
very limited audience.-HARRY F. 
SCHLICHTING, associate professor of 
education, Ohio Wesleyan University. 


Much progress has been made.—W. H. 
DUTTON, associate professor of educa- 
tion, UCLA. 


We need more critiques of existing re- 
Search, and critical analyses by. special- 
ists. EDWARD J. KELLY, i " 
educational division, Colorado State Col- 
lege, 


I enjoy “It Says in The JER,” some re- 
search reports, and “Field News."—M. 
DALE BAUGHMAN, associate professor 
of educational administration, University 
of Illinois. 


JER research reports are varied and have 
Pertinent professional material, but some 
are too heavily laden with  statistics— 
JOHN A. POWERS, administrative as- 
sistant, Washington Junior High School, 
Racine, Wis. 


CONTINUED SERVICE 


As some of you are aware, The Journal of Educational Research 
has recently conducted a sampling of reader reactions in the interests 
of continued improvement and service. 


One interesting result of this survey has been to reveal just who it 
is that makes up the growing roster of JER subscribers. Probably no 
other journal in the field of education has so catholic an audience. 
There is the seasoned university scholar in New York, the eager 
school teacher in California; the librarian in Montana, the experi- 
ment-minded curriculum supervisor in Mississippi; the progressive 
superintendent in Texas, the consultant in Arabia; the college profes- 
sor in Maryland, the busy provost in Canada. 


To meet the varied needs of this international cross-section of edu- 
cation is the challenging role of this journal. The JER seeks to be 
both a pace-setter and a common denominator. In a real sense The 
JER is the publication which communicates to the universe of educa- 
tion. It seems to help bridge the gap between investigation and en- 
lightened applicaggn, linking the professional researcher, the admin- 
istrator, and the classroom practitioner. 


The unique role of The JER was once delineated by its editor in 
these words: 


The Journal of Education Research is a research publication for field 
workers. If one wishes that teachers take a student attitude toward 
teaching, then The JER is a goldmine of ideas about how other people 
study the problems of education. Yes, we provide findings; but more 
important, we illustrate the ways that alert teachers study their problems. 


The results of our reader poll constitute a testimonial to the work 
of The JER in providing broad coverage in significant depth. We are 

articularly gratified at the vote of confidence for our new monthly 
ene. "It says in The JER." We are pleased so many of you recog- 
nize that over the years the "big names" in education appear among 
our contributors, presenting substantial evidence on major roblems 
of education. Your replies indicate The JER is widely read, widely 
quoted, and widely used. 


But our survey has also revealed reader awareness of a problem 
that has plagued us—the lag between the time a manuscript is 
accepted and the date it is published. So extensive is the research 
now being conducted in education that The JER's backlog of quality 


manuscripts is mounting steadily. 


Fortunately we are in a position to do something about it. Thanks 
to your response to our current development program, the list of 
JER subscribers has doubled in the past 15 months. This aye us the 
resources with which to add extra pages to each issue and to add an 


extra issue a year. 


So, beginning with the next volume, you will receive ten regular 
issues of The JER annually, instead of nine. Each issue also will be 
“fatter.” Our long-range plans call for additional expansion in 1963-64. 


We of The JER thank you for your interest and support. We en- 
courage your continued submission of outstanding research reports 
or articles. In turn, we ledge our continued dedication to the ad- 
vancement of the scientific study of education. 


It Says in The JER... 


HOW DOES THE USE OF A TEACHING MACHINE COMPARE WITH, LECTURE- 
DISCUSSION? In a limited study at Temple University, programmed instruction stacked up very 
well, Hough reports: (1) teaching machine instruction was found to be equally as effective as 
the lecture-discussion method in terms of end-of-course tests, and (2) TM instruction was found 
to be considerably more efficient in terms of instructional time saved. In the same study Hough 
could find no significant differences among three response modes—constructed, selected, or a 
combination of the two. 


WALKER AND STOLUROW stick pins in some currently accepted notions about teaching 
machines and programmed learning in the report of a study examining the comparative effective- 
ness of overt and covert responses. One group of students, working through a standard pro- 
grammed learning text, responded “overtly” in writing; another group responded covertly by 
merely "thinking" the answers. No reliable differences in learning or retention measures were 
found, and the covert-response group actually required less time than the overt-response group. 
While these data suggest that learning can be more efficient with a covert-response method, the 


researchers recommend further studies that will explore the role of error rate in the whole problem. , 


COMPUTER-BASED TEACHING MACHINES “will adapt auto-instructional programs to indi- 
vidual students’ needs and abilities in the not too distant future,” Bushnell predicts in a survey 
of the application of the digital computer to educational functions. He describes the situation as 
holding the potential of an important technological breakthrough.” 


PROGRAMMED LEARNING HAS A GREAT FUTURE in the schools, Lambert says, provided 
both more sophisticated and more practical research is performed, and provided teachers are 
helped to see the teaching machine as an aid and not a threat. 


WHICH IS MORE EFFECTIVE, TEACHING MACHINE OR PROGRAMMED TEXT? Feld- 
husen and Birt experimenting with nine groups of 30 college students each, found no significant 
difference in amount learned from a form of linear program when presented with a mechanica 
teaching machine, with a simple masking device, or with no device at all. Nor did any differences 
show up between individual and group pacing. However, the researchers indicate the sample o 
programmed material used in their study was very short, and suggest that "in a longer learning 
session the differences among groups might have been greater, and a clearer picture of relation- 
ships among the methods might have been afforded." 


WHAT ARE THE ATTRIBUTES OF A PARTICULAR PROGRAMMED TEXT? After work- 
ing in the Denver public schools with one of the first programmed texts оп English grammar, 
Reed and Hayman report: (1) in general, student groups using the programmed text scored no 
higher on tests than control groups covering the same material in traditional fashion, (2) Eng- 
lish 2600 was somewhat more effective with students who have high achievement than with low- 
achievement pupils, and (3) more able pupils worked through English 2600 more rapidly than 
did average ability students, who in tum worked much more rapidly than did low achievers. 
The researchers also detected that when pupils know instinctively that their responses are correct, 
but do not get reinforcement from the printed answer, they are confused and many are angry. 


HOW DO YOU CONSTRUCT А PROGRAM? Evans, Glaser, and Homme suggest a “Ruleg Sys- 
tem," in which material to be covered is broken down into “rules” and “examples” in terms of the 
desired outcome, carefully ordered, assembled into frames, tested, and revised. They offer the sys- 
tem as “а helpful prompt." 


> 


AFTER DEVELOPING AS A RESEARCH TOOL a standard learning program that lends 
itself to experimental variation, Evans, Glaser, and Homme draw these conclusions from tests con- 
ducted with groups of college students: (1) variations in mode of responding affect learning time, 
but final test scores are not so affected; (2) systematically constructed programs are more efficient 
than subjectively-designed programs; (3) differential retention effects are a function of the type 
of performance measured. The authors go on to suggest we have only scratched the surface in the 
n of programmed techniques for the investigation of problem-solving and concept-formation 
ehavior. 


ARE THERE BOUNDS OUTSIDE OF WHICH THE RELATIVE EFFECTIVENESS OF 
PROGRAMMED INSTRUCTION IS NOT DEMONSTRABLE? Definitely yes, say McDonald 
and Allen after attempting to test the comparative effectiveness of five programming arrange- 
ments, using a short program and a small sample. Between their various forms of programmed 
material, and in comparison with no program at all, they could find no significant differences in 
student achievement. Their conclusions are: (1) within some as yet undefined length limit, pro- 
grammed content does not yield superior learning, and (2) the effects of programmed instruction 
will be demonstrated for units of content which exceed by progressively larger amounts this as yet 
undefined upper limit for “short” units. This experiment focusses attention on the question, 
pest eu what conditions does programmed instruction yield a marked improvement in 
learning? 


INVESTIGATING A BASIC FORM OF PROGRAMMING, known as “discriminative transfer 
learning using literal-prompt withdrawal,” Taber and Glaser have been able to teach youngsters 
how to sight-read the names of various colors. They suggest their study may have a practica 
application to other subject matters as well as being of theoretical value in fundamental research 


into learning processes. 


DOES IT MAKE A DIFFERENCE HOW THE CONFIRMING RESPONSE IS PRESENTED? 
Krumboltz and Bonawitz tested the relative effectiveness of an “isolated word" answer versus the 
same answer presented “іп context" with a restatement of appropriate material. While their “con- 
text” groups did not exceed the "isolation" groups in knowledge of terminology on a final test, 
they did excel significantly in ability to apply principles. If confirmed by further research, their 
study suggests that "the more frequently à desired response occurs in contiguity with relevant 


stimuli, the greater the probability of effective learning. 


IS BRANCHING MORE EFFECTIVE THAN A LINEAR PROGRAM? Not according to an 
exploratory study by Roe. Investigating certain. methods of presenting programmed material that 
might speed up the learning process without adversely affecting terminal performance, Roe came 
to the following conclusions: (1) there was no significant difference in either learning time or test 
scores between the branching and the linear methods, and (2) there was a significant difference 
in learning time (but not in test scores) favoring the forward over the backward branching pro- 
cedure. He also found that item sequence seems to have a significant effect on student erform- 


ance. Because the branching procedures he tested were relatively simple, Roe suggests his nd- 


ings “ ; i igation i i articularly 
ings “should not discourage investigation into more complex branching procedures, particu'a 
those which allow Seodifications to the branching rules during the course of student learning. 


jj hat pro- 
THE UNITED STATES AIR FORCE ACADEMY has been “unable to demonstrate that 
grammed шз is manifestly superior to conventional classroom methods for the teaching 0 
elementary statistics," Lt. Col. Smith reports, but his study does indicate that the time required 
to achieve learning can be reduced through programmed instruction. His cadets responded very 
favorably to the experimental method. 


t how to use the University Library by 
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DOES STUDENT ATTITUDE AFFECT PROGRAMMED LEARNING? Della—Piana believes jt 
does. In an exploratory study, student “motivation,” defined as student evaluation of the method, 
seemed to be positively related to performance on a constructed-response program. He calls for 
more studies “on how motivation operates to influence performance,’ and suggests the practical 
advantage of taking preferences into account in assigning programmed texts to students. In his 
study, Della—Piana could find no significant differences among various response modes in terms of 
either immediate or delayed retention, but the more highly prompted presentations did yield an 
advantage in time-saving. 


WHAT IS THE FUTURE OF PROGRAMMED INSTRUCTION? Stolurow makes these pre- 
dictions: (1) Auto-instructional methods and devices are here to stay, teachers will come to gras 
their potentialities, and the comparative study of live versus automated teaching will cease, (2 
future research will concern itself with discovering the important characteristics of materials and 
methods, and this will lead to a theory of teaching, (3) basic changes in our thinking about what 
a course is will take place, and courses will be revised as a result of new insights, (4) research will 
demonstrate the importance of sequence factors, (5) the devices of the future will be either books 
or computer-based machines, small devices will drop out, (6) there will emerge a new form of 
dynamic individualized programming that will be capable of aiding persons to solve problems. 


IN ONE OF THE FIRST EXPERIMENTS ON PROGRAMMED LEARNING WITHIN A 
SCHOOL SETTING, Lambert, Miller, and Wiley found that the most significant factor associ- 
ated with immediate acquisition of the programmed material was intelligence. Furthermore, they 
found that the covert response method took significantly less time for completion without a corre- 
sponding decrease on the immediate-retention test. The researchers feel that the interpretation of 
such results should be made on the basis of the power of the tests of significance, and the results 
of such an analysis used to judge the benefit and direction of further experimentation. 


EIGEN, FILEP, GOLDSTEIN, AND ANGALET, in a study comparing machine, horizontal-text, 
and vertical-text modes of presentation, confirm other findings in this issue that inter-mode differ- 
ences in learning are not statistically significant, although eighth-grade pupils programmed texts 
completed their work in somewhat less time than did*pupils using machines. The researchers 
suggest that the interaction between IQ and presentation mode be studied more closely. 


Ji 


Programmed Texts in Vermont 


A major effort is being made at Goddard College 
to introduce programed texts into the schools of Ver- 
mont. About one-half of the superintendents, prin- 
cipals, and mathematics teachers of the Vermont 

- schools were consulted concerning their interest іп 
introducing mathematics programed materials at this 
time; all were definitely interested. It is believed the 
programed materials would be of particular benefit 
to small schools for the following reasons: 


1. Great diversity of ability and achievement 
within a classroom group. The most obvious 
example of this is the one-room school which 
"may contain from two to eight grades taught 
by a single teacher. Less obvious Lem more im- 
portant is the fact that the limited enrollment 
and curriculum of a small school results in stu- 
dents representing almost a complete range of 
ability being taught the same subject in a single 
classroom—in larger schools, а wider váriety of 
courses are offered so students of different abil- 
ity levels can take different courses, and ability 
grouping can be used. because there are several 
sections of the same course, With programed 
materials the teacher should be better able to 
handle several ability groups within a single 
class. 


2. The limited curriculum. The most obvious ex- 
ample is the transfer student who is unable to 
take a course when he should because it is 
taught only on alternate years. Less obvious but 
more important is the fact that students cannot 
take courses for which the enrollment is low— 
this pertains not only to somewhat special 
courses but also to standard courses, such as 
physics and twelfth-year mathematics which are 
omitted not only on alternate years but also on 
the years when the sampling nature of the popu- 
lation of small schools results in only a few 
interested students. ) 

Dr. Robert Schweiker, Professor, Educational Re- 
search, of Goddard College, Plainfield, Vermont, 
describing mathe- 
in order that 


ing Vermont schools. If any one is able to send 
example copies of mathematics programs, these mate- 
rials would be shown to the superintendents, prin- 
cipals, and teachers and the materials would be le 
in schools where they will be tried out. 


Field News 


Delinquency Research 


Тһе Jewish Vocational Service of Milwaukee has 
been awarded a grant of $400,000 by the National 
Institute of Mental Health to conduct a five-year 
research and demonstration project in the fiel of 
proa aget This project represents a coordinated 
effort by the Milwaukee Public Schools, the Probation 
Department of the Children's Court, and the Jewish 
Vocational Service to demonstrate a program which 
is expected to lessen the impact of delinquency on 
this community. 

The aim of the program is to demonstrate that half- 
time work experience in combination with half-time 
school attendance will reduce the anti-social behavior 
of adolescents who have displayed delinquent be- 
havior in the past. Other goals of the program include 
lessening the drop-out rate from the schools and 
increasing participation of these youths in the aca- 
demic work. 

About 190 fourteen and fifteen year old youths who 
have had at least two police contacts were selecte 
by the Milwaukee Public Schools for referral to this 

roject. This will include some youngsters returnin| 
bou institutional settings. Half of these referrals will 
be assigned to an experimental group and the other 
half to a control group. The control group will remain 
in school full time and will receive services that are 
presently available from the schools and other com- 
munity agencies. The experimental group will spend 
half of each day in school and the other half at the 
Jewish Vocational Service where they will participate 
in a controlled work experience program. The work 
they will do will be obtained through sub-contract 
from local industries and will be supervised by a 
workshop staff trained and experienced in working 
with teenage delinquents. 

Psychological сше will be available to the 
adolescents and work wi also be done with their 
families. If other communi agencies are working or 
have worked with a particular family, an attempt will 
be made to work together with the other agencies. 
Youths also will be offered services in the area of 
recreation and recreational planning. 

Individuals assigned to the experimental group 
will be able to participate in this program for three 
years. An attempt will be made to assist the youths 
in their vocational planning, and when it is felt the 
individual is placeable in competitive employment, he 
will be assisted in finding a job. Efforts will be made 
to encourage individuals to obtain further schooling, 
either academic or vocational. 


Reading Interest 


A. reading interest inventory was given to about 
6,000 sixth, seventh, and eighth grade students in 
1961 in the public schools of Rockdale, Texas. Some 
of the findings reported by Mr. J. M. Moorman, who 
conducted this study, are as follows: 


No. No. No. 
Students Students Students 
2179 1860 + 
6th 7th 8th 
grade grade grade 
% % % 


A. What do you read at home? 
Бооке E 
Magazines 
Comics ^»; ew Th 
Bible. КОШЫ 


Other аа ЗЭ, BO) X on 32 


B. Approximately how many 
hours do you read per 
week outside of school? 


Less 1 hour ...13.6....... 14055 See 119 
Phot es ЖҮ т" Cre Jom es 14.0 
2 hours....... Id ous JET rd. 17.3 
8-4 hours ...,..21.5....... TS ricis 25.6 
5-7 hours ..... 1708: 556 ШАШ 19.3 
Over 7 һоџштѕ.. .14.6....... 105.2 119 
C. What subject do you like most? 
Science ....... P^ t: Mp given DD A 25.7 
Маа 96:0 ptas PBB oes 28.4 
English ....... 1985 іл 20.057 50904 22.5 
Soc, Science ...20.9....... Уз cale 22.8 


New Research Bureau 


Colorado State College at Greeley has established 
a Bureau of Research Services which is designed to 
perform the following services and activities: 

1. A research design апа planning service for grad- 

uate students, faculty members, and public 
schools. 


2. A research reporting service, including a semi- 
annual research monograph reporting research 
done by students, асу members, and college 
divisions and organizations. 


3. Stimulation and initiation of research within the 
college and among public schools in the state. 


The co-directors of the Bureau are Jack N. Sparks 
and Herbert H. Hughes. 


College Drop-outs 


In a study of the college drop-out, conducted by 
two University of Utah officials and reported in Col- 
lege and University, Winter Issue, it was found that 
the poor college students who drops out of school and 
then re-enters after using his drop-out period in a 
meaningful way, such as military service, church 
work, or satisfactory work at another college, returns 
as a much-improved student and tends to do much 
better scholastic work, 


UCEA Proceedings 


The University Council for Educational Adminis- 
tration, Incorporated, at the meeting of the Board of 
Trustees in February at Atlantic City decided to pub- 
lish papers presented at the UCEA Research Semi- 
nars. $1600 will be allocated for the publication and 
purchase of four hundred copies of the book which 
will be distributed to each participating member 
institution. 


John Schmid, College of Education 
University of Arkansas, Fayetteville, Arkansas 
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FOREWORD 


. THE TEACHER AND THE MACHINE 


We all know the instant prejudice that one meets when one speaks to a practicing teacher — and 
sometimes to administrators — , about teaching machines. Of course, we don't call them teaching 
machines — we speak of ‘programmed learning." But the teachers aren’t fooled. They ‘‘know”’ 
that what we are talking about is a teaching machine, and what they immediately visualize is a life- 
size machine, a robot teacher: something that is going to roll into their classroom on its metal 
wheels and stretch out its metal arms and take their children and their jobs away from them. Some- 
times they become 50 concerned about this possibility that they will ask: Why do we need machines? 


What can programmed instruction do that I can't do? 


]t is becoming easier to answer these questions. We need machines, first of all, because we do 
not have enough quality teachers to keep pace with our growing populatioa. Already schools all over 
the nation are unable to find enough teachers for their pupils, and there is no indication that this sit- 


uation will improve in the future. 


re serious problem. The amount of knowledge that must be assimilated 
they are able to earn à livelihood is increasing on an exponential trend 
the total accumulated knowledge of the human race. How 


can our present educational system cope with this situation? Must our teachers spend more and 
more of their lives in school, or have they already reached the point where they must both study 
and work during their entire lives if they are to keep up with the advance of knowledge in their own 


fields? 


But there is an even mo 


by our young people before 
curve along with the curve representing 


A related problem is the growing recognition of individual differences, individual abilities, and 
individual needs in the student. Almost every educational journal today carries articles recom- 


mending that school curriculums be individualized. 


A breakthrough in education is clearly necessary — and programmed learning is this break- 
through. With programmed instruction schools can take both the teacher shortage and the rapid ad- 
vance of knowledge in their stride. What is more, programmed learning provides a unique oppor- 
tunity to give each student а curriculum which is paced to him. 67 


ammed learning do that the human teacher can't do? In one sense the answer is, 

*«Nothing.'" Because these programs are written by human teachers, there is nothing they can do 

that an individual teacher, Or rather tutor, devoting himself hour after hour and day after day to 

one individual child, could not do. Essentially, the teaching machine is а private tutor working un- 
m teacher. 


der the supervision of the classroo 


«Programmed learning might solve these 
The idea of teaching machines seems 50 с 
and my work mechanized. ” 


What can progr 


The teacher may Say to us, problems and do these 
things, but I still don’t like it. old, so rigid and imper- 
sonal. I don’t want my classroom 
Again the teacher is thinking in terms of а stereotype, in terms of the factory or the assembly 
line. The goal of programmed instruction is not to turn out identical products or identical minds. 
The average program is no more impersonal than а textbook — often it is a textbook. In fact, it 


is more personal, because it is self-paced. 


Misunderstanding of what programmed learning really is has been only one of the reasons why. 
so many educators have been slow to accept it. Another, and perhaps more important, reasonis , 
the shortage of research and experimentation. Psychologists are currently researching all aspects 
of programmed learning; but so far there appear to have been very few significant discoveries. 
Meanwhile, publishers are issuing contracts for programmed material at a rapid rate — one com- 
pany issued 184 contracts this year — with little or no concern for the correct technology of pro- 
gramming. There is no question but that at least 50% of the subjects now taught in schools and 
colleges will be programmed in one form or another before the psychologists have discovered a 
basic program technology. 


As one reads over the manuscripts included in this journal, one is struck by the overwhelming 
reports of no significant difference between the various treatment and response modes. hat is 
the explanation of this? Obviously, most of these studies are too short-range. The programs 
they are working with are too brief, and differences are not maximized. Can behavior eally be 
changed in 90 frames? Researchers should make an effort to produce differences by extreme ma- 
nipulations of variables, even though this might not be practical in the classroom. At least it 
would provide a bench mark as to what is possible for programmed learning. Longer Studies are 
needed. 


_A number of studies have been made comparing programmed and conventional instruction. 
Usually the results of these studies have seemed slightly to favor the program; however, it has 
not always been clear whether the two groups were actually working with the same materials and 
under the same conditions. An unfortunate result of all these comparative studies, too, has been 
that they inevitably suggest that there is a competition between the teacher and the machine — 
that victory for one is defeat for the other. 


In spite of these drawbacks, the research in general suggests that programmed learning has 
а great future. But if our schools аге to make full use of this discovery, there are two things 
уе must do. First, we must see to it that studies of programmed learning are well designed so 
that the results can be of use. And second, we must help teachers to realize that programmed 
learning is a new and exciting tool, not a threat. We must think and speak, not in terms of ‘‘the 


teacher or the machine,” or ‘‘the teacher versus the machine,” but of ‘‘the teacher and the 
machine. as 
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A Comparison of Branching Methods for Programmed Learning 


$ 
% 

IN EARLIER STUDIES 1* with linear programs, 
where each student had to go through all the items 
in a predetermined sequence, we failed to find any 
significant differences in the terminal performance 
of students who had worked with multiple choice 
machines, composed response machines, pro- 
grammed texts requiring overt responses, and 
programmed texts requiring no responses. How- 
ever, significant differences had been obtained in 
the amount of time required to complete the learn- 
ingsession. We therefore decided to investigate 
those methods of presenting programmed material 
which might speed up the learning process without 
adversely affecting terminal performance. 

Many varieties of branching programs have been 
suggested to accomplish this purpose ( backward 
branching to missed items, backward branching to 
review ntire sequence of items, backward 
branched alternate form items, lateral branch- 
ing to supplemental or prerequisite material, 
branching down to more detailed items for slow 
students, branching up to faster programs for 
bright students, forward branching, etc.), but 
very few experiments? have compared the relative 
merits of the various branching methods. Early 
in the.analysis of the problem, We concluded that 
all types of branching and linear programs are re- 
ducible to one common prototype form (selective 
forward branching), and that the distinction be- 
tweenthe various name-type is more arbitrary 
than is commonly suspected `. Furthermore, by 
casting all of the “different?” types of branching 
procedures we wished to investigate into the pro- 
totype form, we could physically present all 


branching formats in one simple inexpensive device. 


Equipment 


A cardboard box* capable of holding two hundred 
4- by 6-inch cards, each of which is one complete 
item in a program, Was devised to solve the prob- 


REESE  —  —— — 


* А 
Footnotes will be found at the end of the article. 


ARNOLD ROE 
University of California, L.A. 


lem of presenting different kinds of branching pro- 
grams simultaneously to a large number of stu- 
dents in an experimental situation. Since the ex- 
perimental design required а record of each stu- 
dent's responses to individual items inthe program, 
a device which could effectively control student be- 
havior, thereby permitting measurement of time 
during learning, was essential. Cost restrictions 
precluded the use of expensive electro-mechanical 
devices. А scrambled textbook was ruled out be- 
cause of the difficulty of controlling browsing or 
peeking aheadfor correct answers. 

Made of inexpensive railroad board, the box is 
easily assembled by drawing а pattern on the board, 
cutting it out with à razor, then folding and fasten- 
ing it with staples and rubber cement (Figure 1); 

A condensed list of instructions on the use of the 
device is attached to the front of the box. 

Numbered cards, each consisting of one item 
in the program, are arranged like а card file in 
the rear section of the box. A series of colored 
file break cards with numbered tabs divide the 
item cards, which also have numbered tabs, into 
groups for easy access to an individual card. Each 
branching sequence is scrambled in order to pre- 
vent browsing and to conceal the program format. 

Item cards, which are either of the multiple- 
choice or composed -response type, are mimeo- 
graphed. Below each of the answer choices on the 
multiple-choice type, or below the space for writ- 
ing in answers in the composed- response type, is 
the correct answer as well as directions to the 
next card in the sequence. This information ap- 
pears within the bottom inch of the card, an area 
which is covered with cellophane tape. A china 
marking pencil is used over the tape to black out 
the correct answers and directions. To indicate 
his answer, the student removes the black patch 
below his choice, thus giving а permanent record 
of his response. After the student has noted the 
correct answer and the number of the next card, 
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he inserts the card into the front section of the box. 
The mask tab must be raised to allow insertion of 
a card through the slot in the upper left-hand side 
of the box (Figure 2). When the tab is released, 
the mask conceals the entire card except for the 
directions to the next card whichare visible through 
an opening at the top (front) of the box and a cutout 
portion of the mask. From the discarded items in 
this front section, a record of the sequence which 
the student has followed is available to the experi- 
menter. 

Instructions which appear on the first five cards 
of the sequence establish a behavioral pattern of 
working one card at a time, slipping it into the box, 
then selecting the next card in the specified order. 
Thus, not more than one card is out of the box at 
any time. Most students who participated in the 
experiment were able to follow the procedure as 
outlined on the cards without additional verbal in- 
structions. 


Subjects 


The subjects of this experiment were the 189 
students enrolled in the freshman engineering labo- 
ratory course at UCLA during the fall, 1961, se- 
mester. The students were classified into aptitude 
thirds according to their scores on the Mathemati- 
cal Reasoning Score on the UCLA Lower Division 
Engineering Examination (formerly used as a col- 
lege admission examination). Within each aptitude 
third, students were randomly assigned to seven 
groups. The first group (linear) worked through 
each item in the program. Тһе second group (for- 
ward) branched forward if their responses to key 
items indicated a knowledge of the particular con- 
сері being dealt with. Тһе third group (backward) 
Saw each item and at the end of a group of items 
covering a particular concept, if their responses 
indicated they had not mastered that concept, 
branched back to the same Sequence of items a 
second time. Thefourth group (backward alternate) 
followed the same procedure as the third group but 
instead of branching back to a repetition of the 
same items, they were branched to alternate items 
covering the same concept. The fifth group (pre- 
test forward) differed from the forward-branching 
group in that they took the criterion test before 
they had a chance to see the program and then re- 
peated the test at the end of the program. The 
sixth group (random) worked through the same 
items as the linear group, but the ordering of items 
was random rather than sequential. The Seventh 

group (text) was given the forward-branching pro- 
gram in the form of a text book instead of the card 
file. 

In all the branching programs, two key branch- 
ing items were used in preference to a single one 
to test the student's mastery of a concept, inorder 
to reduce the possibility of the student's guessing 


the correct answer and thereby skipping over a se- 
quence of items which he did not really know (Fig- 
ure 3). 


Material 


The program consisted of six introductory and 
instruction items, 93 items covering seven con- 
cepts of elementary probability (relationship be- 
tween information, degree of certainty, and prob- 
ability measure; sum of probabilities, range of 
probabilities; additive law for mutually exclusive 
events; multiplicative law for independent events; 
multiplicative law for dependent events; union of 
events; union of complementary events), and 14 
criterion test items (two for each of the concepts). 
The programmed material was administered first 
to a few individual students, revised, tried in a 
pilot study with 55 students in a similar freghman 
laboratory group during the spring of 1961,9 re- 
vised again, tried on six individual students, and 
revised for the last time prior to the September, 
1961, experiment. 


Procedure 


Approximately eighty students at a time met to- 
gether in a large room during their regular sched- 
uled freshman engineering laboratory class hours 
in the second week of the fall, 1961, semester (see 
Figure 4). The subject content was part of the re- 
quired curriculum and students were informed that 
the final examination at the end of the semester 
would include questions on elementary probability. 

Forward, backward, backward alternate, linear, 
random, and pretest forward programs were used 
by different students in the classroom at the same 
time. However, since the card handling procedure 
with the special boxes was the same for all meth- 
ods, the students were not aware that differentse- 
quences of items were being presented to other 
Students. The group of students who used the for- 
ward branching programmed texts met together in 
а separate room. 

Data was recorded on the learning time, error 
Score during learning, criterion test time, and 
criterion test score. 


Results 


The data were in all cases first tested for in- 
teraction effects between aptitude and time or 
Score by analyses of variance. Since no interac- 
tions were observed, the aptitude scores were 
used as a covariate with the dependent variables, 
and analyses of covariance results are reported 
below. Normality of distribution and homogeneity 
of variance requirements were satisfied in all 
cases. 

In evaluating the results of the experiment, the 


first question posed was whether the programmed 
material contributed significantly to final student 
performance. Comparisons were made between 
(1) pre-learning test scores versus post-learning 
test scores for the pretest forward group, and (2) 
pre-learning test scores of the pretest forward 
group versus post-learning test scores for the 
combined linear, backward, backward alternate 
and forward groups. Tables I and II show a highly 
significant difference in test scores (pre-learning 
mean score: 29. 3%; post-learning mean scores: 
63.8% and 65. 7%) and in test time. 

Overall analysis of covariance on post-learning 
test scores (significant at the 0. 01 level),is shown 
in Table III, and on learning time (significant atthe 
0.0005 level), is shown in Table IV. 

There was no significant difference in the post- 
learning test times between the groups (linear, 
forward, backward, backward alternate, random), 
who used the same equipment, і.е., the branching 
box. 

Of considerable interest was the effect of care- 
ful sequencing Ф program items оп learning. А 
previous study had indicated that for Short pro- 
grams careful sequencing may not be important. 

A comparison of the random and linear groups in 
the experiment, showm in Table V, indicates that 
the random sequence group performed significantly 
worse on learning time, errors made during learn- 
ing, and on post-learning test score. 

Comparison of other pairs of methods indicated 
no significant differences between the linear and 
forward branching methods (Table VI), no signifi- 
cant differences between the backward and back- 
ward alternate methods (Table VII), a significant 
difference only in learning time between linear 
versus combined backward and backward alternate 
methods (Table ҮШ), and similarly a significant 
difference only in learning time between the for- 
ward versus combined backward and backward al- 
ternate methods (Table IX). А comparison of for- 
ward versus the pretest forward groups shows the 
pretest group taking significantly less time for 
learning and for the posttest (Table X). 

A comparison of the groups which had the for- 
ward-branching program in the box versus the 
group which had the forward-branching program 
in the form of a text indicates а significant differ- 
ence only in the learning time (Table XI). 

A comparison of the ‘amount of branching for 
each of the seven concepts is shown in Table хп. 

Furthermore, in the backward branching group 
17% of the students did not miss апу of the decision 
items at the end of each concept and therefore did 
not have to branch back into а review of the items 
in that concept. The worst student had to review 
57% of the items in the program, and, on the aver- 
age, 34% of the items were reviewed, i.e., this 
group saw 1. 34 times as many cards as the linear 
group. Similar results prevailed for the backward 
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alternate group. In the forward group only 7% of 
the students failed to answer correctly on at least 
one of the pairs of decision items at the beginning 
of each concept. The brightest student saw only 
43% of the items, and, on the average, this group 
saw 0.73 times as many cards as the linear group. 
The pretest forward group, on the average, saw 
only half as many cards as the linear group. 


Conclusions 


The program was effective in significantly 
changing the performance level of the students. 
Furthermore, careful sequencing of items has a 
significant effect on student performance, at least 
for programs of some length and complexity. While 
there was a significant difference in learning time 
(but not in test scores), favoring the forward over 
the backward branching procedures, there was no 
significant difference in either learning time or 
test scores between the forward and the linear 
methods. Therefore, the simple branching proce- 
dures tested here do not, by andlarge, seem to be 
more effective than a linear procedure. This, how- 
ever, should not discourage investigation into 
more complex branching procedures, particularly 
those which allow modifications to the branching 
rules or strategies during the course of student 
learning. 
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TABLEI 


PRETEST FORWARD GROUP: PRE-LEARNING VS. POST-LEARNING RESULTS 


N Adj. S.D. MI Vo F Significance 
Mean 

Pre-learning 
test score 29.3 21.3 

20 1 37 27.70 @ >. 0005 
Post-learning 
test score 63.8 21.4 
Pre-learning 
test time 39. 60 9.2 

20 1 37 52.74 @ >. 0005 
Post-learning 
test time 21.65 6.5 

TABLE II 


PRETEST FORWARD GROUP: PRE-LEARNING SCORES VS. COMBINED LINEAR, BACKWARD, 
BACKWARD ALTERNATE, AND FORWARD POST-LEARNING SCORES 


N Adj. S.D. Vi Vo F Significance 


PF pre-learning 

test score 20 29.3 21.3 
1 125 66. 82 @ ».0005 
Combined post-learning 

test score 108 65.7 17.0 


TABLE III 


OVERALL ANALYSES OF COVARIANCE: POST-LEARNING TEST SCORES (PER CENT) 


a 


N Adj. S.D. Vi Vo F Significance 
Mean 

Linear 31 69.7 13.2 

Random 32 59.4 18.7 

Forward 26 62.4 17.9 

Pretest forward 20 63.8 21.4 6 178 2.80 е .01 
Backward 27 67.0 17.4 

Backward alternate 24 62.6 19.5 
Programmed text 26 53.2 22.5 


ROE 


TABLE IV 


OVERALL ANALYSES OF COVARIANCE: LEARNING TIME (MINUTES) 


L———————— 


N Adj. S.D. Уі Vo F Significance 
Mean 

Linear 31 118.1 16.1 

Random 32 131.2 20.6 

Forward 27 110.2 23.4 

Pretest forward 20 91.9 1011759, 6 181 29.7 @ >. 0005 
Backward 27 135.1 22.9 

Backward alternate 26 135.7 22.4 

Programmed text 26 80.2 19.3 


елау 2050050) Cl кыыс с eee — 


TABLE V 


RANDOM SEQUENCE VS. LINEAR SEQUENCE 


N Mean S.D. Vi Vo F Significance 


Learning Time 


Linear 31 118.1 16.6 1 60 7.6 e 003 
Random 32 131.2 20.6 

Error Score During Learning 
Linear 31 9.97 5.3 1 60 58.7 @ > 0005 
Random 32 21.44 6.8 

Test Time 
Linear 31 21.2 4.6 0 € m Ns 
Random 32 26.7 6.3 

Test Scores 
Linear 31 69.7 13.2 1 60 6.9 Ф .% 
Random 32 59.4 18.7 

TABLE VI 


LINEAR У8. FORWARD 


Adj. 
N Mean S.D. Уі Vo F Significance 
Test Score 
“Linear 31 69.7 13.2 1 54 3.3 N.S. 
Forward 26 62.4 17.9 
ing Time 
einen 31 118.1 16.6 1 55 2.2 N.S. 
Forward 27 110.2 23.3 
‘pest Ti 
талын 31 21.2 4.6 1 55 .01 N.S. 
Forward 2 21.3 5.1 i 
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TABLE УП E 


BACKWARD VS. BACKWARD ALTERNATE 


N Mean S.D Уі Vo Ж Significance $ 

Test Score 

Backward 27 67.0 17.4 1 48 .99 N.S. 4 

Backward Alternate 24 62.6 19.5 ] 
Learning Time 

Backward 27 135.1 22.9 1 50 .02 N.S. 

Backward Alternate 26 135.7 22.4 2 
Test Time 

Backward 26 25.5 6.9 1 45 .01 N.S. 

Backward Alternate 22 25.0 5.2 8 


TABLE VIII 


LINEAR VS. COMBINED BACKWARD AND BACKWARD ALTERNATE D 


N Mean S.D. Vi Vo F Significance 
Test Score 
Linear 31 69.7 13.2 qq 1.29 NS. 
Combined Backward 51 64.9 18.4 
Learning Time W 
Linear 31 118.1 16.6 l . 0005 
Combined Backward 53 135.4 22.4 ЕЕЕ AEN, @ > 
Test Time 
Linear 31 27.2 4.6 
Combined Backward 48 25.3 6.1 thee oe Th 4.4 N.S. 
М 
TABLE IX 


FORWARD У8. COMBINED BACKWARD AND BACKWARD ALTERNATE 


N Mean S.D Уі Vo F Significance fi 

„== тш а ES Ee. Sen 

Test Score E 

Combined ва 64.9 184 5 74 0.61 N.S 

Learning Time Ў 
С ааваа 5 135.4 d i т 2.9 (0 >.0005 | 
Test Time k 
Hiat Backward 2 Hu bd : @ 2.2 N.S : 
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TABLE X 


FORWARD VS. PRETEST FORWARD 


N Mean S.D Vi Vo F Significance 
___ _ o a 

Test Score 

Жаны... 20 бі 24 } % 0:64 N.S. 
Learning Time 

arora POR 20 ТЕ 2 i > 1 44 8.6 @ .004 
Test Time 

Ha X | : AH 5 44 9.0 e ‚005 
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TABLE XI 


FORWARD-BRANCHING PROGRAM IN BOX vs. IN TEST 


Re 


тк лал аиы Ол 
ТЕ 
N Mean S.D. Vi Vo F Significance 
Test Score 
Box 26 62.4 17.9 1 49 2.87 N.S. 
Text 26 53.2 22.5 
Learning Time 
Box 27 110.2 23.3 1 50 21.3 @ 7.0005 
Text 26 80.2 19.3 
Test Time 
Box 27 27.3 5.7 1 50 0. 68 N.S. 
Text 26 30.2 16.9 


TABLE XII 


PERCENTAGE OF STUDENTS BRANCHING AT EACH PROGRAM CONCEPT 


Concept 
1 2 3 4 5 6 7 
Backward 28 34 17 4l 10 21 17 
Backward Alternate 32 28 16 40 24 24 20 
Forward 4T 11 10 20 23 11 17 
Pretest Forward 80 30 90 15 35 n 10 35 
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Freshman students in the Department of Engineering, UCL, A) абы a 
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The Teaching of Elementary Statistics by the Conventional 


Classroom Method Versus the Method of 


Programmed Instruction 


Background 


In view of the growing interest in teaching ma- 
chines and programmed instruction, itis important 
that the effectiveness of this teaching technique be 
tested by a large number and variety of well con- 
trolled experiments. A study was accomplished 
at the United States Air Force Academy during De- 
cember 1961, which should help to establish the 
position of programmed instruction in the educa- 
tional community. 

The main effort of this experiment was directed 
toward a comparison of the effects of two methods 
of instruction (treatments). The subjects were 128 
freshman cadets at the United States Air Force 
Academy who were considered typical of those pur- 
suing the prescribed course in mathematics. The 
128 cadets were divided into two groups by à meth- 
od which is explained under experimental design. 

A control group was taught a short course in 
statistics by the ‘‘conventional method.’’ This re- 
fers to the procedure ordinarily used in conducting 
classes in mathematics at the college level, which 
consists of the use of a textbook, study outside the 
classroom, and regular classroom meetings. Dur- 
ing these meetings, the instructor lectures, the 
theory and homework problems are discussed, and 
examples of the material being studied are worked 
under the supervision of the teacher. 

The experimental group was taught the same 
material by programmed instruction. This was 
accomplished by giving each с adet a teaching 
machine in the form of a scrambled book and in- 
structions on how it was to be used. 

Both treatment groups were required to attend 
classes five days each week from eleven to twelve 
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o’clock; however, the experimental group was 
given no formal instruction. An instructor was 
present in each room used by the experimental 
group but was available for individual assistance 
only. Those cadets in the experimental group de- 
pended entirely on the scrambled book, individual 
consultation with the teacher, and any independent 
study they wished to undertake. They were per- 
mitted to work at their own speed but were re- 
quired to proceed through the program at the rate 
of at least one lesson per class meeting. The 
course consisted of twelve lessons plus a one- 
hour final test which was administered immediate- 
ly after the completion of the course. 


The Program 


The control group studied statistics from a 
booklet prepared by the Department of Mathemat- 
ics at the Air Force Academy. To insure that the 
experimental group would study the same materi- 
al, it was considered necessary to program this 
pooklet. The programming was done by the au- 
thor. The intrinsic programming technique was 
employed (3) and the final program was prepared 
in the form of a scrambled book of 337 pages. 


Experimental Design 


A “treatments by levels" experimental design 
was employed (4). Two treatments were applied, 
the conventional method of teaching and pro- 
grammed instruction. 

Four ability levels were established. This 
was accomplished by a ranking of the subjects, 
which was based on their achievement in mathe- 
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matics during the first two-thirds of the semester. 
The top thirty-two cadets were placed in level one, 
the second thirty-two in level two, etc. Within 
levels, sixteen cadets were assigned to each of the 
treatment groups by a random process, thus form- 
ing eight cells. 

A different instructor was assigned at random 
to each of the eight cells. In an attempt to mini- 
mize further the instructor variable, instructors 
were reassigned, again randomly, after lesson six. 

Two sets of data were collected, individual 
scores on the final test (achievement criterion) and 
the time used by each subject to complete the 
course (time criterion). Analysis of variance was 
used to study each of the two sets of criterion 
measures obtained. The experimental design al- 
lows one to test both the hypothesis that there is 
no difference in the overall effect of the two treat- 
ments (main effect) and the hypothesis that there 
is no difference in the effect of the two treatments 
at different levels of ability (simple effect). The 
F-test was employed to test the hypotheses. 


Data Collection 


The final test was prepared by the author and 
submitted to several members of the Department 
of Mathematics for evaluation. A problem-solving 
test was decided upon. Following the administra- 
tion of the test at the completion of the course, the 
reliability was computed using Cronbach’s coeffi- 
cient alpha (1) and adjusted by the Spearman- 
Brown Formula (2). The value of alpha was 0. 82. 

Data on time required to complete the course 
were obtained by requiring each cadet to maintain 
a daily record of the time he devoted to the study 
of statistics, both during formal class meetings 
and outside of class. 

Two questionnaires were completed at the end 
of the course. One was designed to measure in- 
terest in statistics and the other to measure atti- 
tude toward programmed instruction. 


Analysis of the Data 


The final test scores were analyzed into the 
sums of squares presented in Table I. From 
these data, the F for the main effect was 


55 
HE \ 
F= + mE „сот. 


The Е for the simple effect was 


Based on these results, the following null hypo- 
theses regarding achievement could NOT be re- 


jected at the ten percent level of significance: 


(1) No statistically significant difference, 
ascribable to differences in the methods 
of instruction, exists between overall 
achievement of the two theoretical pop- 
ulations from which the treatment 
groups were assumed to have been 
drawn. 

(2) No statistically significant differences, 
ascribable to differences in the meth- 
ods of instruction, exist in achievement 
at any of the four ability levels of the 
two theoretical populations from which 
the treatment groups were assumed to 
have been drawn. 

The amounts of time consumed by the subjects 
in completing the course were analyzed into the 
sums of squares presented in Table П. Time was 
measured in minutes. From these data, the F 
for the main effect was 


58 55 
F =A + тў = 129.4, 


Thus, the following null hypothesis regarding time 
consumed in completing the course COULD be re- 
jected at the 0. 1 percent level of significance: 


No statistically significant difference, as- 
cribable to differences in the methods of in- 
Struction, exists between the average time 
consumed in completing the course by the 
two theoretical populations from which the 
treatment groups were assumed to have 
been drawn. 


From the data of Table II, the F for the simple 
effect was 


55 55, 
ЖАСА ИН са 
Fe + ag = 217. 


Based on this value, the following null hypothesis 
regarding the time consumed in completing the 
course could NOT be rejected at the five percent 
level of significance, but it COULD be rejected at 
the ten percent level: 


No statistically significant differences, а8- 
cribable to differences in the methods of 
instruction, exist in time consumed in 
completing the course at any of the four 
ability levels of the two theoretical popula- 
tions from which the treatment groups 
were assumed to have been drawn. 


This result indicates an area needing further in- 
vestigation. 

The mean time used to complete the course at 
each of the four ability levels is given in Table III. 
This table indicates that rejection of the hypothe- 
Sis regarding time used to complete the course i$ 


due to a greater consumption of time by the con- 
trol group than by the experimental group. The 
average time used by the experimental group was 
sixty-six percent of that used by the control group. 

A point biserial correlation was computed be- 
tween scores obtained from the experimental and 
control groups on the questionnaire, which was de- 
signed to measure interest in statistics. The co- 
efficient was found to be 0. 05, thus it is NOT pos- 
sible to conclude that one method stimulates more 
interest in the subject than the other. 

Some results of the questionnaire designed to 
measure attitude toward programmed instruction 
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ently. This is an area in which further research 
is needed. 

The students being taught by programmed in- 
struction did, on the whole, respond favorably to 
this method of instruction, did consider it more 
efficient, and did feel that they had more opportu- 
nity to receive individual assistance from the 
teacher than under more conventional methods. 
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TABLE I 


SUMS OF SQUARES FOR FINAL TEST SCORES 


Sources of Variation 


Treatments (A) 

Levels (L) 

Between Cells (C) 
Treatments by Levels (AL) 
Within Cells (W) 


Total (T) 


Degrees of Freedom 


Sums of Squares 


55 д = 16. 53 

SS; = 10, 238. 10 
550 = 10, 643. 53 
SSAL - 388.90 


EE = 27,401. 69 
SS - 38,045. 22 
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TABLE II 
SUMS OF SQUARES FOR TIME CONSUMED IN COMPLETING THE COURSE 


к _———ЄЄ—Є—Є—Є——Є—ЄЄ—Є—ЄЄ—ЄЄ—ЄЄ—Є——————Є- 


Sources of Variation Degrees of Freedom Sums of Squares 
ee 
Treatments (A) 1 55, - 4,625, 742 
Levels (L) 3 55ү, = 1,808, 783 
Between Cells (C) " SS, = 6,667, 572 
Treatments by Levels (AL) 3 SS. = 233,047 
Within Cells (W) 119 SSy = 4,252, 581 

Total (T) 127 SS, = 10,920, 153 
mu Сг ee 
TABLE III 


TIME CONSUMED IN COMPLETING THE COURSE 


=... 


Ability Level Time Consumed (in minutes) 
Experimental Group Control Group 
1 698 982 
2 568 997 
3 819 1,132 
4 840 1,335 
Mean 738 1,117 
TABLE IV 


REPLIES TO QUESTIONS INDICATING ATTITUDE TOWARD PROGRAMMED INSTRUCTION 


1: 
Question 02570 MN 
Yes Undecided No 


1. Did you enjoy taking this course by 50 6 8 
programmed instruction? 


2. Do you prefer programmed instruction 38 10 16 
to conventional classroom instruction? 


3. Do you feel that you learned with less 37 10 17 
effort by programmed instruction than 
by conventional classroom instruction? 


4. Do you feel that you have more оррог- 42 4 18 
tunities to get individual questions 
answered in a course taught by pro- 
grammed instruction than you do in a 
course taught in the conventional manner? 
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A Comparison of Overt and Covert Response in 


Programmed Learning * 


THE IMPORTANCE and function of overt re- 
sponse in learning has been debated by both learn- 
ing theorists and educators (Spence, 1951; and 
Dewey, 1959). Progress toward the solution of 
this problem under conditions that would satisfy 
both the theorist and the educator has been difficult 
up to the present time since rigorous controls can- 
not be achieved in the ordinary classroom. Pro- 
grammed self-instruction, however, provides lab- 
oratory-like conditions for the study of learning 
within an educational context and with meaningful 
educational materials. The present study was con- 
ducted to determine the relative effectiveness of 
overt and covert practice during learning when 
some basic concepts of descriptive statistics were 
taught by programmed self-instruction using acut- 
back page, linear programmed text. (See Stolurow, 
1961, for a description of the format. )** 


The Theoretical Issue 


The disagreement about the role of the overt 
response is a basic and persistent one among 
learning theorists. It separates those who identify 
with S-S theory from those who identity with an S- 
R theory. Тһе 5-5 theorists take the position that 
overt response is unimportant and place their em- 
phasis upon perceptual organization and central 
processes. 

Much earlier, Thorndike (1913) had pointed out 
а basic deficiency in the 5-5 approach. A preva- 
lent misconception among psychologists at the 
time of his APA presidential address was А 
that an idea tends to produce the act which it re- 
presents or resembles or is “ап idea of,’ or ‘has 
as its object.’ " He goes on to say feis . the 
gist of my contention is that any idea or other sit- 
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uation tends to produce the response which hered- 
ity has connected with it or which has gone with it 
or some part of it with a satisfying or indifferent 
resulting state of affairs. An idea has no power 
to produce an act save the power of physiological 
connections born in man, or bred in him as the 
consequence of use, disuse, satisfaction and dis- 
comfort.” Guthrie (1935), much later, in com- 
menting upon Tolman's position makes the same 
point in a slightly different way. 


‘Signs, іп Tolman’s theory, occasion 
in the rat realization, or cognition, or 
judgment, or hypothesis, or abstrac- 
tion, but they do not occasion action. 
In his concern with what goes on in the 
rat's mind, Tolman has neglected to 
predict what the rat will do. So far as 
the theory is concerned the rat is left 
buried in thought; if he gets to the food- 
box at the end that is his concern, not 
the concern of the theory. ”” 


For some reason, the problem of relating stim- 
ulus to action has not been perceived as the impor- 
tant problem for 8-5 theory; rather the problem 
has been one of accounting for the gap between 
stimulus and knowledge. 

The gap between Stimulus and response is а 
central concern in a theory of behavior that would 
relate to educational needs. To use Tolman's ter- 
minology, the problem for education is not simply 
one of getting the student to learn sign-significate 
relations, but rather it is also one of getting the 
student to learn the movement patterns (speaking, 
writing, spelling, typing, dancing, etc.), that re- 
late those associations within him to the external 


is (1961) of the first author, supervised by the second. 
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world and thus provide communication and records. 


We do not want to leave the student buried in 
thought. 

S-R theory, in this sense, has taken a broader 
mandate in that it is an attempt to account for a 
wider gap, ог, in fact, for two gaps. Hull's 
classical papers of the thirties not only made a 
case for the overt response in simple learning, 
but also employed the overt response as the basic 
material from which central mediation processes 
are fashioned. Osgood (1953) formalized and ex- 
tended Hull's basic position to language behavior 
апа linguistic mediaton. Mowrer (1960) has ге- 
cently extended this type of S-R analysis to virtu- 
ally all symbolic and cognitive processes. All S- 
R mediation theorists fashion the mediating pro- 
cess out of overt behavior and thereby make the 
overt response a critical and pivotal notion. 


Educational Significance of Overt Response 


The role of the overt response in terms of edu- 
cational development has probably had its clearest 
and most emphatic proponent in Dewey. As early 
as 1897 Dewey wrote: 


“I believe that the active side precedes the 
passive in the development of the child na- 
ture; . . . , I believe that consciousness 
is essentially motor or impulsive, that 
conscious states tend to project themselves 
in action. 

“I believe that the neglect of this principle 
is the cause of a large part of the waste of 
time and strength in school work. The 
child is thrown into a passive, receptive, 
ог absorbing attitude. The conditions are 
such that he is not permitted to follow the 
law of his nature: the result is friction and 
waste." (Dewey, 1959, p. 28.) 


Dewey's views were widely accepted and inter- 
preted to mean that the more active the learner 
was during practice the better would be the learn- 
ing that resulted. Furthermore, efforts have been 
made to obtain classroom data in support of this 
position. Gates (1917), for example, found that 
active recitation was particularly beneficial in 
memorizing meaningful material. More recently, 
studies by Lumsdaine and Gladstone (May and 
Lumsdaine, 1958); Lumsdaine, May and Hadsell 
(May and Lumsdaine, 1958, Ch. 6); Kimble and 

Wulff (1953) and Michael and Maccoby (1953) have 
supported the efficacy (some with significant dif- 
ferences, others with trends), of active overt 
practice. 

The greater effectiveness of overt over covert 
practice is not a consistent finding, however. For 
example, Ash and Carlton (1953), Jaspen (1950), 
Kurtz, Walter and Brenner (1950) and Roshal 


(1950) found that various ‘‘participation’’ exercie 
which were required during a film presentation jj 
terfered with the learning that resulted. Studies 
by Michael (1951) and von Wright (1958) produce 
Similar results. 

Thus, in spite of the prevailing educational vie 
we find the experimental evidence inconsistent, 
Some studies indicate that activity can facilitate" 
the learning process. Other data, however, in 
cate that activity can distract the learner or inte 
fere with the learning process. Taken together 
they suggest the existence of an interaction effeet 
overt activity during practice sometimes aids and 
sometimes interferes with learning. Consequei 
active responding can not be indiscriminately ree 
ommended; it is necessary to consider the task t 
be learned and the learner. 

Another question that arises is the degree to 
which one can generalize from these findings of 
the studies which seemed to favor covert prac 
in view of the differences that exist between lea 
ing in an experimenter-paced situation from se 
paced instruction, where the learner determines 
the rate at which the new information is presente 
to him. It seemed, therefore, that it would реш i 
ful to determine the degree to which these finding} 
would apply to self-paced instruction. 7 


Temporal Contiguity and Learning Efficiency 


It might be useful to re-examine learning situa 
tions to determine if some basic condition is cor 
related with the method and materials of instruc: 
tion, determining the efficiency of the learning 
that results. It is possible that overt vs. covert- 
response is not the real problem. For example, 
it simply takes longer to make an overt response 
than to make а covert one. Thus, with covert re 
sponses the interval between the stimulus-responst 
connection and the interval between one cue-stim= 
ulus and the next are shorter. On the other hand, 
the effectiveness of overt practice may lie in the 
degree to which the overt response is cued by the, 
otherwise practiced covert response. If the stu- 
dent who can think a word can write it correctly, | 
there already exists an established cue-response 
relationship between thought and action and it 5 
would presumably make little difference whether і 
he was required to make a covert or an overt re- 
Sponse in the course of learning. On the other 
hand, if the student had not learned this relation- 
Ship then it might make a great deal of difference. 
if he practiced an overt or a covert response, for 
in practicing the overt one he would be building up 
the required relationship between saying the word 
to himself and writing it. This problem is clear 
est in the learning of a foreign language when yO 
learn to speak the language and then learn to read 
it. If the spelling is not consistent in its phonetic: 
orthographic relationships, the student is in for 
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surprises, е.6., spelling awful and ought. 

This analysis suggests that two factors might be 
operating to produce quite different results from 
the use of covert practice. One factor is contiguity 
ortemporal compression; the other the above - 
mentioned relationship between thought and action. 
Provided the cues produced by the covert response 
were already associated with the desired overt 
response, learning would be more efficient by 
practicing the covert response, since it wouldhave 
а shorter latency. On the other hand, if these con- 
nections were not well established, then practicing 
only the covert response would be less effective. 

It would seem that these factors could determine 
the relative effectiveness of overt and covert prac- 
tice in learning from materials presented in a pro- 
grammed self-instructional text. To the extent 
that the primary task is one of associating cues 
with covert responses that already are linked to 
overt responses, then students should learn more 
efficiently from practice that requires only the 
covert response. 


Purpose 


In this study we were primarily concerned with 
determining whether this argument was useful. 
Therefore, the study was one of the relative effec- 
tiveness of overt and covert responses during 
practice when it could be assumed that the covert 
responses practiced by one group were relatively 
well established cues for the conventional overt 
response that the other group practiced. 

А second independent variable was the order in 
which the retention tests were made. This vari- 
able was included to check on an earlier finding by 
Postman, et al. (1948), that two types of retention 
tests did not produce independent measures of re- 
tention but rather each was affected by the other. 
Using a nonsense syllable learning task, they found 
that recognition was poorer if measured after re- 
call than if measured before it, and that recall was 
better if measured after recognition than if it was 
measured before it. 


Method 


Dependent Variables 


Four dependent variables were used in this 
Study. Three were performance measures in the 
form of paper-and-pencil tests presumed to reflect 
learning or retention of the subject matter covered 
in the program. The fourth variable was the time 


required by the subject (S) to complete the program. 


Tests and test scoring. Tests given before the 
programmed learning session are referred to as 
pretests, tests given less than one day after the 
learning session are referred to as immediate 


post-tests, and tests following the learning ses- 
sion by two weeks or more are called retention 
tests. The tests differ in two ways. They differ 
in format so as to allow the manipulation of order, 
the second independent variable described above, 
and they differ in the type of items they contain. 
Опе type of item is designed to measure the а850- 
ciation of a predicate with a subject of a sentence. 
The other type of item is designed to access any 
differential effects of the independent variables on 
**association reversal learning’’ (Stolurow, 1961; 
Stolurow, Hodgson and Silva, 1956). Forexample, 
if the S were taught that the mean is a measure of 
central tendency then that is the ‘ *forward'' direc- 
tion of association. In testing for association ге- 
versal we would determine if 5 could answer the 
question, “А measure of central tendency is the 

." The test presumed to meas- 
ure association reversal learning is referred to 
as the AR test. 

The other two tests are presumed to measure 
practiced response learning. They differ in for- 
mat so as to allow recall and recognition respond- 
ing. The recall test is a “fill-in” test. Тһе 
recognition test is а multiple choice test. These 
two tests are denoted FI and MC, respectively. 

FI’ and MC’ are used to designate slightly differ- 
ent forms of the tests used in a pilot study. The 
total number of correct responses possible for 
AR is 14, for FI, 18 and for MC, 15. 

Since the main concern of the study was the 
evaluation of the relative effects of overt and 
covert responses on learning equivalent, forms of 
the tests were not constructed for each testing 
session. Each 8 took the same form of every test 
on each occasion that he was tested. 

The basic criterion used in judging the correct- 
ness of responses was the extent to which the re- 
spondent’s answer was, rather than resembled, 
the response designated as correct by the program. 


Time measures. Time measures were obtained 
at various points in the program by the use of an 
item in the program directing the S to «поќе time 
here ." Several methods of 
displaying the time were tried in the pilot study. 
It appeared that the most workable method was to 
provide one large electric clock for the entire 
groupof Ss, set it at 12:00 and plug it in when the 
group was told to begin work on the program. 58 
were asked to record time correct to the nearest 


minute. 


Design 


Four groups of Ss were employed. Three 
comprise the pilotstudy. The fourth group con- 
stitutes the Ss used in the final study. In each 
group assignments to subgroups were made on à 
random basis. 
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Pilot study. Due to scheduling difficulties and 
time pressure, the pilot study groups each followed 
routines differing from that used in the final study. 
Each pilot study group worked through theprogram; 
the routines they followed omit one or more of the 
testing sessions used in the final study. Invervals 
between sessions and specific tests administered 
also varied. 


Final study. Three separate sessions were 
uséd. The first session was a testing session used 
for the administration of the three learning tests. 
The second session, held a week after the first, 
was both an experimental and a testing session. 
Directly after working through the program ( a 
Short break was optional for the S), each S took 
the tests again. Two weeks after this session the 
tests were readministered. Order of test admin- 
istration was held constant for the first two testing 
sessions and manipulated only during the retention 
test session. 


Materials 


Content. Several characteristics were consid- 
ered in determining the material to be used. Three 
requirements appeared most important. First, the 
program had to be one with which either covert or 
оуегі responses could be made. Second, for moti- 
vational purposes it was desirable that the mate- 
rial covered in the program be fairly relevant to 
the Ss' interests. Third, the material could not be 
impossibly difficult for the average S, nor so fa- 
miliar or accessible that gains or losses in crite- 
rion scores could be due primarily to non-experi- 
mental factors. 

It was felt that the introductory unit, Unit I, 
Írom the program Descriptive Statistics met these 
requirements reasonably well. 

In content, Unit I is primarily given to defining 
elementary concepts useful in descriptive statis- 
tics. Responses required of the S are, largely, 
written verbal responses, with some discrimina- 
tion responses (S is to pick out a correct instance 
of a concept) and a few simple calculations (e. g., 

S is to read a graph). 

Unit I is a fixed-sequence or linear program. 
There is no provision within it for varying the or- 
der of the material presented to S according to his 
pattern of successes or failures. Each S follows 
the same sequence, unless he retraces ог skips 
ahead on his own initiative. In practice, skipping 
or retracing is rather awkward and it is probable 
that most Ss followed the same Sequence in work- 
ing through the material. 


Format. Because of the expense of the com - 
mercially available program, permission was ор- 
tained to reproduce the material in mimeographed 
form. (See ** footnote on first page. ) 


The format used was Stolurow's (1961) cutback 
page programmed booklet with separate answer 
Sheets. In this form the program allows § a Self- 
paced progression through the material, provides 
him immediate knowledge of results, and furnishes 
a record of responses made by the overt response 
group. 


Procedures 


Procedures followed at each session were very 
much the same for both the pilot and the final study, 
The first session was opened with a few remarks 
by E on the general nature of the experiment. §s 
were informed that the object of the experiment 
was the evaluation of the teaching device, not an 
evaluation of themselves. No reference was made 
to the variables actually employed. 

The programmed learning session was preceded 
by instruction in the use of the cutback booklet. 
Following this instruction, which did not make use 
of the booklet itself, the group was divided into 
overt and covert response subgroups and the covert 
response subgroup asked to meet in a different 
room. 

The overt response subgroup was instructed 
that it was unnecessary to correct mistaken an- 
Swers on the response sheet. After a pause to al- 
low answering of questions, they were told to be- 
gin work and the clock was started. The covert 
response subgroup was informed that they were to 
work as previously instructed except that they were 
not to write out the answers to the questions but 
were only to *'think'' them. This subgroup was 
Started in the same manner as the other. Both 
Were encouraged to work at a comfortably brisk 
pace. On completing the program each S was giv- 
еп a set of tests which he worked in the same 
room, after an optional ‘‘break. ” ^ 

The entire group of Ss took pre- and retention 
tests in the same room. Overt and covert response 
Subgroups were sent into separate rooms for the 
experimental treatment and the immediate post- 
test to control the possibly disturbing motivational 
factors expected to be present if both groups 
worked through the program together. 

Ss were instructed to work the tests at a com- 
fortable pace, to try to answer all questions, and 
to proceed through the pages in order withoutlook- 
ing back at previous pages. All Ss took their pre- 
and immediate post-tests in the sàme order: АВ, 
FI, MC (Order2). For the retention test session, 
overt and covert groups were divided into two sub- 
groups, one subgroup in each response group tak- 
ing the tests in Order 2 again and the other sub- 
group now taking the tests in Order 1 (AR, MC, Fl) 


Subjects 


Pilot study. Group 1 was composed of Ss from 
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the University of Illinois 1960 summer semester 
introductory psychology course. There was an 
approximately equal number of males and females 
inthis group. Group 2 was composed of elemen- 
tary school teachers and was predominantly fe - 
male. Group 3 was a group of University of Illi- 
nois education majors. In this group males and 
females were roughly equal in number. 


Final study. The final study drew on the Uni- 
versity of Illinois Psychology 100, 1961 spring se- 
mester introductory psychology course. There 
were no females in this group. Fifty-one Ss com- 
pleted all three sessions. Five Ss present] previ- 


ously failed to appear for the retention test session. 


Results 


Pilot Study 


The pilot study results were used in three ways. 
First, they were used in an informal way to verify 
the capability of the program to teach its subject 
matter, and the capacity of the test to reflect such 
learning. Second, they were used to make a pre- 
liminary assessment of the effects of the independ- 
ent variables. Third, they were used to estimate 
sample sizes for the final study. 


Gain scores. The schedules followed by the 
groups of the pilot study make impossible a statis- 
tically precise test of the gains from pretest to 
immediate post-test which uses all relevant data. 
However, taking conservative estimates of gain 
and variance (smallest observed gain and largest 
observed variances), from both Group 2 and Group 
3, and using . 50 as the correlation between pre- 
and immediate post-tests, the t for gains on both 
FI’ and МС? were significant af the . 05 level. 

Since no control group is available for compar- 
ison, it could be argued that these gains are due 
to the practice provided by subsequent administra- 
tions of the same test form. However, on the 
whole, items used in these tests sample content. 
Very little in the way of reasoning is demanded of 
the S, and no knowledge of results is provided be- 
yond the indefinite help given by the multiple 
choice format (the MC’ test was given last in each 
testing session prior to the retention session to 
make it impossible that this aid should be reflected 
іп AR and ЕІ scores). Thus there is good reason 
for taking the gain scores as representing learning 
due to the program alone, to an interaction of 
tests and program or, perhaps, both. 


Independent variables. The standard t test of 
differences between uncorrelated means indicated 
that there were no reliable differences between 
overt and covert treatment test means of the same 


Group (at the . 05 level of significance), before or 
after the experimental session. 

Significant differences were found in program 
completion times. In each Group the overt re- 
sponse subgroup took longer to complete the pro- 
gram on the average than the covert response sub- 
group — Group 1 took 67.0 minutes for overt and 
53.3 for covert, for Group 2 the times were 13.1 
and 44.1, respectively, and for Group 3, 54.5 and 
35.9, respectively. The standard t test indicated 
a significant difference in time тейпв for Group 1 
and 3 (2. 92 and 4.06, significant at the . 05 level). 
In Group 2 the usual t test assumption of equality 
of population variance was not tenable at the .05 
level, largely due to one very deviant time score 
in the overt response subgroup, and the appropri- 
ate t test (Walker and Lev, 1953, р. 157), indi- 
cated no reliable difference in the two means. 

There were no reliable effects of recall-recog- 
nition test order administration on test scores. 

An analysis of variance (Lindquist, 1953) also 
showed no significant order effect. It did, how- 
ever, show a significant effect for '' Test. ” MC’ 
Scores were оп the average significantly higher 
than FI’, despite the fact that FI’ contains 18 items 
and MC’ only 15, in partial confirmation of Post- 
man, et al. ’s (1948) results. 


Final Study 


Main effects. Main (disregarding order) effects 
for response modality are given in Table I. The 
findings are consistent with those of the pilot study. 
Using the t test there are no reliable differences 
at the . 05 Tevel of significance between overt and 
covert response group test scores preceding or 
following the experimental session (see Table I). 
Ав before, only program completion times differ 
significantly. 

Dividing each S's gain from pretest to retention 
test by his program completion time yields the ef- 
ficiency assessments summarized in Table II. The 
covert response condition is the more efficient 
practice method in FI, but mean efficiencies do not 
differ significantly in AR and MC. 

Recall-recognition test administration order 
effects did not differ reliably (at the . 05 level) 
from one another within response modalities be- 
fore or after the experimental treatment. 

A three-way analysis of retention test variance- 
Response Modality x Order x Test, similar to that 
used in the pilot study - was also made (Lindquist, 
1953). The only significant effect (F = 14.49, sig- 
nificant at the . 05 level), was for Test, in agree- 
ment with the pilot study. The data analyzed in 
this analysis of variance were obtained by random- 
ly rejecting Ss from each treatment until 10 Ss in 
each treatment were obtained. 


Correlations. Intercorrelations of test scores 
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did not show any reliable effect of response modal- 
ity. There were no differences between corre- 
sponding correlations of overt and covert response 
groups significant at the . 05 level. 


Analysis of covariance. AnIBM 650 was used 
to perform an analysis of covariance in which re- 
tention test, immediate post-test, and time scores 
were analyzed using adjustments from regression 
planes fitted to the scores of the three pretests. 
Data for all Ss who completed the retention test 
session were used, giving a total N of 51. 

An inspection of error variance for the treat- 
ment groups indicates that error variance can be 
reasonably considered homogeneous. Homogeneity 
of regression was tested for all dependent varia- 
bles, with the result that the hypothesis of homo- 
geneity cannot be rejected at the . 10 level of sig- 
nificance. As would be expected from the magni- 
tude of test intercorrelations which, with one ex- 
ception, were reliably non-zero, the regression 
planes account for a reliably non-zero proportion 
of the total variance of all retention test and im- 
mediate post-test scores. Оп һе other hand, time 
to complete the program cannot be considered pre- 
dictable from pretest scores. 

F tests of the fitted constants for treatments 
and interactions (Kempthorn, 1952), were also 
made. The only variable showing variance reli- 
ably associated with an experimental treatment 
was time to complete the program. 


Discussion 


The Educational Problem 


Response modality. Main effects for response 
m ity have been considered separately from 
the analysis of covariance because of their special 
relevance to the practical aspect of the overt- 
covert response issue, The effects of order, it 
could be argued, are primarily descriptive of the 
tests manipulated, and do not serve to evaluate the 
capacity of the program to instruct. There are 
quite likely a large number of test sequences which 
could affect criterion scores, but, valuable as such 
findings might be for some purposes, from a point 
of view that prefers a teaching instrument which 
produces a non-test-specific gain in competence, 
the desired evaluation of the program should be 
based on data averaged across order. Тһе dis- 
tinction need not be labored here, however, for no 
effect is found for order and the response modality 
main effects are nowhere contradicted by the anal- 
ysis of covariance. ) 

For raw Scores the conclusions аге also homo- 
geneous across variables. No treatments had dif- 
ferential effects on association reversal learning 
as compared to practiced response learning. In 


the case of efficiency measures, however, covey 
responding is found to be more efficient than oye 
responding only with the fill-in test. An econom 
cal explanation of this finding takes note of the fa 
that the mean gains from pretest to retention te; 
for the fill-in test are in the order of twice the 
gains for both the association reversal andthen 
tiple choice tests, suggesting that covert respo 
ing may become more efficient than overt as a 
function of the gain induced by the program. 
The data indicate that response modality make 
no difference in criterion scores for the verbal. 
task involved in the program used, in a populatig 
from which the study's sample is representatiy 
and where learning is carried to a fairly low de. 
gree of competence. The effect of varying ге-| 
sponse modality under these conditions appears 
only in the total time needed to complete the in= 
Struction. Contrary to the position taken by 
Skinner (1958) the data suggest that learning maj 
be more efficient if the response is not compose 
overtly. 
Apart from the savings in time, the essential 
instructional equality of the two practice conditi 
has implications for the conservation of teaching 
materials. Even where time is no object, the dà 
suggest that the educator might well question the 
wisdom of providing expensive devices solely be 
cause they allow overt responses, and he should: 
alert to the possibility of advantageously modifyin| 
existing devices or changing the procedures rece? 
ommended for them. For example, in some cire 
cumstances he could save on the cost of pro- 
grammed textbooks by simply requiring covert | 
practice of his students, thus allowing the re-ust 
of the books. i 
Some caution is, of course, advisable here. 
The program was read through only once by the 8 
in this study, and their learning and retention 4 
Scores are modest by usual standards. Тһе find 
ings of this study do not necessarily apply to sil 
ations where several rereadings of the progra 
are used, or where Ss are generally motivated 
produce higher end-test scores. However, the! 
planation outlined above in accounting for thegré 
er efficiency of covert practice with the fill-i 
Suggests that the efficiency of covert respond 
would be found to be increased in a situation № 
Ss were more highly motivated, but given the 
amount of practice as Ss in this study. If additio 
al practice beyond the one reading used in this | 
Study were given, von Wright's study (1958) 5 
gests that it might be desirable to use a practi 
Sequence in which the S progresses from Cov! 
to overt practice. T 
One possible difference between overt and 
Covert practice is suggested by the interesting a! 
that the only Ss failing to attend the retention ti 
Session who Rad come to the experimental ses: 
were Ss from the covert response group. ТПО 
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these five Ss’ perception of the experimental task 
is unknown, it is possible that the drop-outs may 
have been due to a decreased feeling of learning or 
“participation, °? suggesting tha. overt and covert 
practice may differ in their effects on motivation. 


Order. The data do not support Postman et al. 
(1948) with respect to an effect of the order of ad- 
ministration of recall and recognition tests. No 
effect of order was found. An analysis of variance 
capable of yielding the same conclusions of Post- 
man et al., confirms only one of their conclusions: 
the difference in difficulty levels between multiple 
choice and fill-in tests. Concluding from these 
findings that order is ineffective where the learn - 
ing task involves meaningful verbal materials but 
not with nonsense syllables is hampered by two 
factors. First, order was manipulated at the re- 
tention session. All Ss had two previous encoun- 
ters with the same tests used in the retention ses- 
sion, and this familiarity may have masked order 
effects. Second, order could not be strictly con- 
trolled under the conditions of test administration 
adopted. Despite explicit instructions to the con- 
trary, some Ss were observed cross-referencing 
from page to page, and possibly from test to test, 
during the retention test session. Conclusions with 
regard to order must therefore remain tentative. 


The Theoretical Issue 


Ostensibly, this study’s findings that overt 
practice did not increase learning or retention 
scores over covert practice is data that could be 
favorable to the position of an S-S learning theory, 
but only in a limited sense. The use of a meaning- 
ful verbal learning task with educated Ss is only 
half the required set of conditions. Such Ss, the 
S-R theorist could argue, already have a repertory 
of symbolic mediating responses originally derived 
from overt practice, and kept at near peak habit 
strength by both generalization from verbal re- 
sponses practiced in the course of daily life and 
occasional direct practice. However, this argu- 
ment does overlook the fact that recombinations of 
responses (in effect, new responses), were learned 
without overt practice. The S-S argument is fur- 
ther strengthened by Cook and Spitzer’s study 
(1960) in which overt practice interfered with | 
learning in a task using nonsense syllables, a sit- 
uation to which the S-R explanation involving à 
pre-existing repertory of responses does notneatly 
apply. However, if we consider that the covert 
response-produced cues are sufficient to effect 
the overt response for these 58, then overt re- 
sponses only slow down the process of association 
by lengthening the stimulus response interval. 
Thus, contiguity differences could explain the 
results. KS 

One characteristic of the program used in this 


study deserves specific comment. It isa low er- 
ror rate program, that is, it is written so that few 
incorrect responses are elicited as the S works 
through it. Therefore, the problem of extinguish- 
ing responses does not become important. This 
may account for the essential equality of overt and 
covert practice (disregarding time of response), 
found in this study, and it has implications for fur- 
ther research. One difference between overt and 
covert practice may very well lie in the ease with. 
which practiced responses can be extinguished 
(Stolurow, 1961). If this is so, the experimental 
tasks for overt and covert responding Ss did not 
differ in this study with respect to a potentially 
distinguishing characteristic. This line of reason- 
ing suggests that error rate be considered an inde- 
pendent variable worth examination, and that per- 
sistence of wrong responses be considered as a 
dependent variable in future investigations of the 
overt-covert response issue involving teaching 
machine programs. This could be implemented, 
for example, by the method used by Kaess and 
Zeaman (1960). 


Summary 


The question of the difference between overt 
and covert responding is an issue having relevance 
not only to a distinction between S-S and S-R 
learning theories, but also to practical questions 
of efficient methods of instruction, particularly 
as related to Dewey's views. 

To gather data bearing on the overt-covert is- 
sue two groups, a total of 56 subjects, worked 
through a standard programmed learning text, one 
group responding overtly (writing), the other cov- 
ertly (*«thinking"). Knowledge of the material was 
assessed prior to the experimental session, im- 
mediately after the session, and after a retention 
interval of two weeks. Individual times required 
to complete the programmed text were recorded. 

An analysis of the covariance of immediate 
post-test and retention test measures, adjusting 
for differences in pretest measures, Was per- 
formed. No reliable differences in learning or 
retention measures were found. Mean times to 
complete the program are, however significantly 
different, the covert response group requiring 
less time than the overt response group. 

The data suggest that learning, under the con- 
ditions prevailing in this study, is more efficient 
with a covert response practice method. Impli- 
cations were stated for 5-5 and S-R learning the- 
ories; currently accepted notions about the gen- 
eral superiority of “composed”’ responses; and 
the use of a teaching machine. Some specific 
suggestions for further research were also pre- 


sented. 
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TABLE I 


FINAL STUDY: OVERT-COVERT MAIN EFFECTS, TEST AND PROGRAM COMPLETION 


TIME MEANS* AND t TESTS” FOR RESPONSE MODE 


Response % Pretest Imm. Post-test Retention Test Program 
Mode AR КІ МС AR FI MC AR FI M Time 
c 
Overt 4.59 4.15 8.44 8.19 12.63 12.48 1.22 10.22 11.96 53.30 
Covert? 5.07 3.69 8.24 7.83 12. 03 12.48 7.46 10.25 11.42 38.76 
t -.55 .03 .21 .44 .82 .00 132  -.03 .80 6.00* 


ЕН ҙе 


ж Significant at the . 05 level. 
E Means in number of correct responses, time in minutes. 


b ts аге based on overt vs. covert for each variable. 


€ Ns: overt, 27; covert, 29, except on retention tests, 24. 


TABLE П 


FINAL STUDY: OVERT-COVERT MAIN EFFECTS MEAN GAINS FROM PRETEST 
MEAN EFFICIENCY MEASURES, * AND t TEST RESULTS? 


TO RETENTION TEST, 


Response Gain Efficiency 
Mode AR Е MC R 
Overt? 


Covert? 


le 


* Significant at the .05 level. 
а Efficiency measures are calculated by dividing individual gains from pretest to 


retention test by individual program completion times. 


b t's are based on overt vs. covert for each test. 


2 Ns: overt, 27; covert, 24. 
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Pictorial and Performance Frames in 
Branching Programmed Instruction’ 


EVER SINCE experimentation with programming 
was initiated several years ago in the Department 
of Instructional Materials, the prime interest has 
been to automate some content instruction at the 
college freshmen level in order to release instruc- 
tors from this increasingly heavy burden and give 
them time for opportunities to work with students 
individually and in small groups at atime when stu- 
dents badly need this personal guidance. As a con- 
tent area, therefore, the instruction of freshmen in 
the use of the University Library was selected. 
Approximately 4000 entering freshmen had previ- 
ously got this instruction by guided tours in groups 
of 50 and by two lectures in groups of 150 students. 
This meant over 100 tours and endlessly repeated 
lectures saddled on every available librarian, who 
also had to correct 12,000 papers covering three 
assignments in the Library. The papers incidental- 

ly were often handed back a month after the assign- 
ment was made when the reinforcement would cer- 
tainly be near zero. 

In case the difference between branching and lin- 
ear programs is not familiar, it should be explained 
that while linear programs usually require all stu- 
dents to go through identical steps, branching pro- 
grams vary the instruction depending on errors 
students make in answering multiple choice ques- 
tions or depending on various other choices they 
can make іп the program. Linear programs use 
consistent positive reinforcement; branching pro- 
grams do not. 

The branching rather than the linear technique 
was used because first of all the instruction had to 
be completely automated. It seemed then, and it 
still seems probable now, that in the linear pro- 
gram with constructed answers, the over-hasty, 
careless, or sly student is more liable to get him- 
self into an impasse requiring personal attention of 
a teacher than in the branching program which im- 
pedes such a student by repetitions or helps him by 
remedial circuits. 

Secondly, the branching technique allowed for a 

much higher degree of individualization of instruc- 


* Supported by NDEA Title VII Research Grant 
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tion since it not only allowed the student to proceed 
at his own rate, as the linear program does, but 
also adjusted the content to the individual student'8- 
ability and knowledge. 1 
Thirdly, multiple choice questions used in (һе 
branching programs made it possible to elicit wrong. 
responses which had already been trained into the? 
student and which it was felt would not be ех- 
guished by mere lack of reinf огсетепі. For exa 
ample, almost all incoming freshmen һауе beet 
trained to “‘ask the librarian" when they аге in any 
difficulty. Such an action by thousands of freshmen 
would be manifestly impractical, a waste of time 
for both students and librarians. This answer 
therefore elicited as a wrong response in order t 
get the opportunity to point out to the student that ii 
would be far more efficient in the end if he learned. 
library procedures for himself. 


Method 


To get the most flexibllity needed for research 
purposes in branching techniques, the items Or 
frames were on two-by-two slides. These were | 
projected from a Sarkes-Tarzian projector which 
had, of course, random access to stimulus frames 
as required by the branching technique--that is, 8j 
slide could be summoned by the student at the key 
board, depending on his responses to the reinforce: 
ing questions. The laboratory facility, called а 
Learning Room, resembled a small preview room 
with the student seated at a key board facing the 
Screen and surrounded by working materials which 
will be described later. 4 

Teaching the skills of using the library is very 
similar to an industrial training problem . 
techniques were therefore used which would ma 
imize transfer to the actual situation on the lib: 
floor, pictorial frames and performance frame 
Pictorial frames included Kodachrome photogr api 
Performance frames required the subject to carr 
out some task before proceeding. | 

All of the instruction frames were half print am 
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half Kodachrome picture in a vertical frame with 
the print above and the picture below. The purpose 
of the pictures was to helpthe students recognize 
on the floor of the library items mentioned in print. 
For example, shelf guides, special dictionary 
stands, location of call numbers, and so forth. 
very large close-ups and graphic devices made it 
possible to call attention to details of such things 
as catalog cards and withdrawal slips. The use of 
pictures was validated by having one random sam- 
ple of freshmen use the programs with pictures and 
another comparable sample without pictures--that 
is, with print only. Preliminary estimates of cur- 
rent research indicate that gains between the pre- 
test and the post-test use of the criterion test from 
the use of pictures will be significant. 
Performance frames required the student to stop 
the program and apply his knowledge to materials 
inthe Learning Room. For example, one frame 
asked him to look ора particular catalog card in а 
card catalog tray on the desk in front of him. The 
multiple choice question on the screen could not be 
answered until the correct cardwasiound. Similar 
performance frames called for looking up words in 
the dictionary, looking up material in encyclopedias, 
in periodical indexes, and some called for finding 
particular books in the tiers of bookshelves with 
which the room is provided. The use of these per- 
formance frames was validated by devising a per- 
formance check-sheet, giving students assignments 
to carry out on the floor of the library, and rating 
them by observation of their performance. On the 
programs which did not have performance frames, 
preliminary statistical estimates indicate a high 
level of significant difference in favor of the pro- 
grams which did include the performance frames. 


Procedure! 


From over a hundred sections of English 101, 
twelve were chosen at random to participate in the 
experiment. Four were assigned at random to learn 
to use the library from the teaching machine only 
(the experimental group), four were to learn the 
same content from a series of lectures (the control 
group), and four were to get ПО instruction in the 
use of the library at all (the zero group). Students 
are normally assigned to A, B, and C classes by 
an English Proficiency Test. Allthree of the a- 
bove research groups contained one А class, two B 
classes, and one C class. Toassure comparability 
between the instruction in the experimental group 
and in the control group, the control group lec 
were given by Mr. Earl Tannenbaum, who partici- 
pated in the programming and structure 
tures on the programmed material. He also us 
in his lectures the Kodachrome slides that were 
produced for the instruction frames. Thus, the 
lectures benifited from the very close examination 
and ordering of content whichthe mac hine pro- 
gramming required. 


All students in all three groups were givena pre- 
test on the content in the first week of school. The 
test was repeated after six weeks when the instruc- 
tion was over. Because there were 62 subjects in 
the experimental group, only one program a week 
could be given. A second form of the test was not 
deemed necessary since the initial scores on the 
pre-test were very low. The reliability of the test 
according to the Kuder-Richardson formula was 85. 
It was impossible to use as a criterion test the only 
nationally published test on library procedure be- 
cause it was much too difficult and many procedures 
in our Morris Library were unique. 

The criterion test Was therefore constructe d 
from the most dependable items from past examina- 
tion questions based on this instruction of freshmen 
and from new items written especially for the study. 
The gains between the pre-test and the post-test 
were the dependent variables of the exper iment. 
Scores were obtained on the E nglish Proficiency 
Test and also on the SCAT, or School and College 
Ability Test. An Esterline- Angus pen recorder 
kept a record of all the choices of each student 
which made possible an item analysis of the pro- 
grams. Considerable information was obtained by 
questionnaires on library sources and expereince 
available to the students from high schools from 
which they originated. Very few students dropped 
out of the experiment during the quarter due to ill- 
ness or other causes. The drop-outs were quite 
equally scattered throughout the twelve classes in= 
volved. 


Results 


Analysis of variance showed no significant dif- 
ference in the gains in achievement scores between 
the machine group and the lecture group. Since the 
hypothesis to be tested was that the machine could 
do as well as the lectures, and since the lectures 
had profited enormously from the progr amming 
technique and from the pictorial slides developed 
for the machines, this result was gratifying. Both 
experimental and control groups were significantly 
petter than the zero group at the . 001 level of con- 
fidence, showing that both methods of instruction 
were worthwhile. To experienced teachers, this 
would seem to be obvious, but many с ollege in- 
structors believe that the best way to teach students 
how to use the library is simply to turn them loose 
in it and let them learn by trial and error. А sub- 
sequent experiment using comparable samples of 
freshmen, sophomores, juniors, and seniors 
showed that not until the end of the sophomore year 
do students acquire on the average the same level 
of knowledge that we taught freshmen in six weeks. 
Furthermore, the spread of score s among the 
sophomores was much wider thanthe scores of our 
instructed freshmen, showing that our instruction 

ovided much better uniformity of knowledge of the 
subject among the students. 
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The branching technique proved its value in that 
some bright or informed students were able to get 
through the individual programs in less thanten 
minutes whereas other students took more than 40 
minutes. Five students out of the 62 in the experi- 
mental group were able to negotiate all the different 
bypasses. Such individualization of instruction 
would not have been possible in linear programs. 

As in other teaching machine ex pe r iments, the 
students in an attitude questionnaire showed a de- 
cided interest in learning from the teaching ma- 
chines. 

A further analysis brought out some interesting 
points. Among the A groups, those with high scores 
in English proficiency, the zero group did just as 
well in gains as the experimental and control groups 
It seems that students with high English proficiency 

Scores are able to use the library without either 
programmed instruction or lectures. 


Discussion 


Undoubtedly, bypasses and re medial circuits, 
especially those based on review questions at the 
end of the program, are the most interesting future 
subjects for investigation. It was found, for exam- 
ple, that when a difficult bypass question was placed 
right in the main line of instruction, the students 
shied away from it because they could see how hard 
it was. When, onthe other hand, they were given 
an opportunity actually to enter the bypass before 
seeing the first difficult question (four were always 
used to minimize guessing), some students, not 
knowing what was meant by a hard question, were 
tempted to enter the bypass and try t he first 
question although they had no chance of passing it 
and were returned immediately to the main line of 
instruction. These results were just the opposite 
from what was predicted. Silberman has suggested 
¿voluntary exits from remedial circuits. Some stu- 
dents in the experiment wished there were volun- 
tary exits from bypasses. That is, during the se- 
quence of four bypass questions, some students for 
some reason changed their minds and wanted to get 
back on the main line and not skipthe instruction. 
Thus, voluntary exits from bypasses willbe used in 
the future. 

Review questions placed at the beginning of a 
program based on a previous program were a nui- 
Sance and a waste of time, because building in the 
necessary remedial circuits made the program slow 

to start. A student eager to begin a new unit of in- 
struction with a new program found himself spend- 
ing considerable time on the previous program. 


A similar research study is now underway using 
audio plus picture and comparing it with this past 
study on print plus picture. Theoretically, using 
an instructional approach using two sense modalities 
simultaneously would be more effective than one 
sense modality employed for both picture and print, 

An important variable uncovered was the rela- 
tionship between the pictures and the printed text in 
each instructionframe. In some cases, the print 
merely called attention to the picture which con- 
tained the essential instruction. For example, one 
of the bypass entry question frames was based en- 
tirely on items in the accompanying p icture. In 
other cases, the print and the picture both provided 
instruct ion. Sometimes the picture merely illus- 
trated the print information without being essential, 
Never, however, was the picture merely embellish- 
ment in the way cuts sometimes are used ina text 
merely to relieve the page of print. An analysis is 
being made of this variable of picture versus print 
in relation to errors made in the programs to 
broach this area of investigation. 


Summary 


Freshmen were taught howto use the university’s 
Morris Library by programmed instruction of the 
branching type equally as well as by lectures im- 
proved by the programming process. The branch- 
ing system allowed some students to complete pro- 
grams in one fourth the time others used. Pictor- 
ial Kodachrome instructionframes and performance 
frames (requiring carrying out tasks during the 
programs) were validated by check-rating students 
or actual assignments in the library. Students us- 
ing pictorial frames were better at the . 01 level 
of confidence and those using performance frames 
at the . 02 level. 
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An Investigation of "Teaching Machine" Variables Using 
Learning Programs in Symbolic Logic * 


BEFORE PROGRAMMED learning can be com- 
pared with current andtraditionalteaching methods, 
a great deal of research willneed to be carriedout 
in order that optimal conditions for programmed 
learning can be specified. Although a number of 
writers have suggested techniques and practices to 
be employed in the writing of programs, these sug- 
gestions arise, for the most part, from generaliza- 
tions from research conducted in contexts other than 
that of programmed learning. The small experi- 
mental literature presently available on program 
characteristics and efficiency may be summarized 
as follows: a) multiple-choice devices which provide 
immediate knowledge of results can be used effec- 

_ tively to supplement regular classroom instruction 
(1,15, 16, 17, 22, 23, 24); b) programmed presenta- 
tions of material, either with or without using а. 
hardware machine, have generally produced better 
criterion performance than non-programmed pre- 
sentations even without the refinem ents which further 
research will bring (12, 13,21); c) the programming 
rule to **use large numbers of small steps’’ seems 
to be substantially upheld (6, 26); and d) results ofa 
study of ‘‘implicit’’ responding, responding with no 
overt spoken or written behavior, cast some doubt 
on the necessity for anovert response at every step 
of the program (6). 

One problem in the new field of programmed in- 
struction is the lack of standardized research tools, 
particularly standardized programs. A standard 
learning program which would teach some clearly 
defined verbal or symbolic behavior and lend itself 
to experimental variation would facilitate research 
considerably. Topics drawn from mathematics or 
logic have muchto recommendthem. The criterion 
behavior, as wellas the stimulior cues in the pres- 
ence of which such behavior is to be produced, can 
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generally be clear ly specified. An additional ad- 
vantage is that a wide range of levels of complexity 
can be chosen. A rationale for the choice of a task 
in symbolic logic, the topic used in the present 
study, is presented in the following section. 

Independent variables of possible relevance to 
programmed learning are numerous, and several 
writers (2,10,19) list suggested variables in some 
detail. One obvious variable, W hich might be 
termed “mode of response” is of particular inter- 
est, since it forms а point of departure between 
Pressey and Skinner, the two chief pioneers in the 
field of automated learning. Pressey (22, 23) con- 
sistently used the multiple-choice technique. Skin- 
ner (27) em ployed a “composed’’ response, which 
does not involve the added cues and distractions 
that multiple-choice answers appear to provide. A 
third mode-of-response variation would be to de- 
mand no overt response at all by the S, following 
pilot studies by Evans, Glaser, and Homme (6). 

А second variable of interest involves the pos- 
sibility of formalizing the process of program con- 
struction. Several writers (11,19,28) have sug- 
gested various techniques and types of program 
steps ог “етв.” Such suggestions аге generally 
insufficient to instruct inexperienced personnel in 
program construction. A programming methodol- 
ogy developed concurrently with the present study 
(7,14) gives specific suggestions bothfor basic types 
of items and for techniques of assembling these 
items into а program. Since it is possible to con- 
struct a program in this way “according to form- 
ша,” such a program would lend itself much more 
easily to experimental additions, deletions, and re- 
orderings. А demonstration that formally gener- 
ated programs can compare favorably with programs 
produced by less specifiable techniques would rep- 
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resent a valuable step in program technology. 

Finally, most of the proponents of machine 
learning (e.g., 22, 27) have emphasized the impor- 
tance of the immediate feedback or confirmation of 
results which the machine provides. The necessity 
for such feedback for the most effective modifica- 
tion of many kinds of behavior is well-documented 
(5). As such, temporal delay of confirmation con- 
Stitutes another potentially relevant variable. Little 
(17) demonstrated the importance of the immediate 
feedback provided by a machine using a non-pro- 
grammed set of multiple-choice items. Perform- 
ance of groups which received immediate knowl- 
edge of results was better than that of groups whose 
responses were scored and returned the next day. 
Whether such immediate confirmation of results is 
critical in the program situation is an experimental 
question, 

The purpose of the present study, arising from 
the foregoing considerations, was three-fold: a) to 
explore the suitability of a task in symbolic logic as 
a topic to be presented with learning programs of 
the teaching machine type; b) to developa ‘‘standard 
learning program ,"' as well as reliable criterion 
measures of the material presented in the program, 
with features which wouldfacilitate further research 
in the area of programmed learning; and c) to in- 
vestigate the effects of variations in methods of re- 
Sponding, program construction, and immediacy of 
feedback on measures of rate of learning and on 
immediate and delayed performance measures. 


Materials and Procedures 
A Rationale for the Use ofa Task in Symbolic Logic 
MM а Сав Symbolic Logic 


A primary purpose of the present investigation 
was the development of a standard learning pro- 
gram which would provide a uniform laboratory task 

_for other studies in programmed learning. Moore 
and Anderson (20) have Suggested the use of a sym- 
bolic logic task drawn from that branch of logic 
known as the “‘calculus of propositions” for use in 
studies of human problem solving. Although the 
present study was primarily an investigation of 
learning rather than problem Solving, symbolic log- 
ic was selected for use because of its flexibility and 
other advantages. Among the featuresof the calcu- 
lus of propositions listed by Moore and Anderson 
which make such a calculus particularly appropri- 
ate for an investigation of programmed learning 
with college students are: no assumption of prev- 
ious training is needed, few students are likely to 
have experienced the subject, problems of any de- 
‘sired degree of complexity can be generated, pro- 
gram length can be altered by varying number of 
rules to be taught, a number of dependent variable 
measures are available, learning time is within 
practical limits, the task is interesting, detailed 
records of subjects’ behavior can bekept, and iso- 
morphic andformalrelationships betweenthe calcu- 


lus of propositions and other topics make possible a 
large number of studies in the area of transfer of 
training. 


The Logical Properties of the Material Presented 
by the Learning Programs 
ы А ыг гала ese АУ ыш ы 


The behavior whichthe learning programs used 
in the present study were designed to produce was; 
a) the ability to recognize and employ certain ele- 
mentary rules taken from the calculus of proposi- 
tions; and b) the ability to use them in proper com- 
binations to construct deductive proofs. Onan 
empirical basis, certain of the theorems and pos- 
tulates of the calculus of propositions have shown 
themselves to be particularly useful in the analysis 
of deductive proofs. Copi (3) lists 19 *«elemen- 
tary valid argument forms, ” and the 12 “rules” 
developed by Moore and Anderson (20) for problem 
solving overlap considerably with the set selected 
by Copi. The selection of a particular set of such 
theorems, rules, or argument forms is, of course, 
arbitrary, and is to be judged only by the suita- 
bility of the rules chosen for the task at hand. For 
this study a set of 15 rules was selected involving 
а total of four different logical Operations: nega- 
tion, conjunction, inclusive disjunction, and impli- 
cation. 

These operations were used on propositions 
symbolized by letters of the alphabet. Inthe learn- 
ing programs Ss were not instructed that they were 
participating in a task involving the calculus of 
propositions, or that, for example, “а + b” could 
be read “а implies b.’’ Rather the task was pre- 
sented as a purely formal one in which Ss had to 
learn to manipulate the symbols and letters given 
in accordance with the rules provided. This рго- 
cedure seemed warranted to reduce as much as 
possible transfer effects from verbal habits already 
present in Ss. 

The particular calculus which the learning pro- 
grams presented had the following features: 

1. Elements. These were represented by let- 
ters of the alphabet (е. g., a, о, К, etc.). 

2. Operators. These were four in number and 
included the symbol for negation (7), conjunction 
(A), inclusive disjunction (V), and implication (+). 

3. Rules. Examples of the 15 rules are pre- 
sented in Table I. 

4. Punctuation. This was provided for by a 
system of underlining to indicate the specific group- 
ing of associated elements. 

The calculus as presented in the programs was 
modified somewhat from conventional procedures. 
First, the operators of the system were given spe- 
cific names to avoid attaching any specific verbal 
associations to them. The symbol which negated a 
proposition (—) was called ‘‘bar,’’ and was used 
like this: a. The symbol for conjunction (A) was 
called “епі, °? for disjunction (V) was called 
“wedge, '* and for implication (+ ) was called spear. 


| 
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Second, the rules of the system were given novel 
names, again to control for differential verbal as- 
sociations by Ss. That is, to refer to the “Сош- 
mutative Law’’ might immediately identify the rule 
to Ss who had had algebra, but have little associa- 
tive value for Ss who had had no algebra or who had 
forgotten it. To control for this effect the rule was 
called the ‘‘Letter Exchange Rule.'' An effort was 
made in the selection of the names of the fifteen 
rules to select as non-technical and descriptive 
terms as possible. Third, a novel method of punc- 
tuation was adopted, again in the hope of simplify- 
ing the presentation. It seemed that punctuation 
using the parenthesis-brace-bracket (е. g.,8), mul- 
tiple parenthesis (e.g.,3), or multiple-square-dot 
(е. р., 30) systems were too difficult to present rap- 
idly and efficiently in a learning program. The 
method used in the programs employed underlining 
to indicate how particular elements were connected 
by the binary operators (conjunction, disjunction, 
implication). The scope of the singulary operator 
(negation) was indicated simply by extending the 
“bar” over the entire length of the expression to 
be negated. 

Since the primary purpose here wasto construct 
a “piece of apparatus” to be used to investigate 
programmed verbal learning, the completeness of 
the rules from the standpoint of symbolic logic, is 
irrelevant in a sense. It seemed desirable to in- 
corporate rules of as general utility as possible to 
extend the range of criterion behavior (construction 
of deductive proofs) which could be demanded. Ta- 
ble I presents examples of the selected set of rules 
and the names given these rules by Copi (3), Moore 
ànd Anderson (20), and the learning program. 


Construction of the Programs Used in This Study 


Two basic programs were constructed to teach 
deductive-proof behavior involving the 15 rules 
drawn from the calculus of propositions listed in 
Table I. The first basic program was developedon 
the basis of the principles of program construction 
available in the literature at the time of its con- 
Struction. This program is called the Initial Pro- 
gram. It was used for a series of four experiment- 
al variations to be reported. 

_ On the basis of concurrent pilot studies, cer- 
tain formal principles of program construction 
Were derived which appeared to facilitate the task 
of programming a subject matter as described by 
Evans, Glaser, and Homme (7). An additional 
Program was developed employing these formal 
Procedures, Тһе latter is called the Formal Pro- 
gram, and was used in two additional treatments. 

. Construction of the Initial Program. It is dif- 
ficult to specify the method of construction of the 
first program for the very good reason that at the 
time of its construction, little programming meth- 
Odology was available. Papers presenting sugges- 
tions for programming (11, 27, 29) were somewhat 


helpful in suggesting item types, but gave very little 
help in problems such as number of items, order- 
ing of items, or positioning of review items. А few 
sample programs were available (e.g.,27), and in- 
itial attempts at programming proceeded chiefly by 
analogy with these prototypes, following the ad- 
monition to ‘‘proceed by small steps. ’’ 

The actual construction of the Initial Program 
proceeded as follows: Each of the 15 logical rules 
was written on a separate index card. The rules 
were then informally ordered on the basis of sim- 
plicity of operation and number of symbols involved. 
The initial items of the program described the 
basic logical symbols and their rules of combina- 
tion. The first rule was then stated, one or more 
examples were worked, and then some sort of re- 
sponse was required of the S. Typical responses 
would be to work a new problem, complete an ex- 
ample, or to state some part of a rule. Subsequent 
items contained either new rules or examples, or 
review of rules already presented. Мо systematic 
review procedure was followed. АП 15 logical 
rules were presented by the forty-third item of the 
program. Subsequent items in the program gave 
examples of how several rules in succession could 
be used to change a set of given symbols into a 
“winning” or terminal position. Instructions were 
included for Ss to justify each step taken by giving 
the initials of the rule used and the step number or 
numbers to which it was applied. Items containing 
a number of problems with various combinations of 
the basic rules were constructed. А total of 72 
items made up the Initial Program. Following con- 
struction of the Initial Program, each of the 72 
items was typed on a separate 5 x 8 inch index card 
to form the rough program. On the back of each 
card was typed the correct response or responses 
for that item. \ 

A feature of programmed learning emphasized 
by Skinner (27) is the critical importance of allow- 
ing the behavior of the student to guide subsequent 
modifications of a given program. Items on which 
Ss make errors can be scrutinized in an attempt to 
determine the source of the error. Inthis way 
ambiguities can be cleared up, unclear typography 
changed, and additional examples and explanations 
added to facilitate inadequately strengthened be- 
havior. The process can then be repeated with ad- 
ditional Ss, and subsequent revisions made until 
the program produces reliable performance at some 
acceptable level. The version of the Initial Pro- 
gram used in the experiment proper represents a 
third major revision based on a careful analysis of 
the responses of 12 pilot Ss. Pilot work wasterm- 
inated when the Initial Program was producing over 
9095 correct responses to program items, and 
completion time on the program was falling within 
the desired two-hour interval. 

The Formal Program. Ав indicated previous- 
ly, the second basic experimental program, termed 
the Formal Program, was constructed in a much 
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more systematic and specifiable manner than the 
Initial Program. The rationale of the Ruleg System 
underlying the construction of this program is pre- 
sented elsewhere (7). In order to check whether a 
learning program written ‘‘by formula’’ could pro- 
duce results comparable to those produced by the 
Initial Program, the following procedure was em- 
ployed for each rule. Item one presented a) a verb- 
al description of the operations involved in apply- 
ing the rule; b) one or more examples of the rule; 
and c) an incomplete example for S to work. Item 
two gave an incomplete statement of the same rule 
and required 5 to give the name of the rule. Item 
three gave an incomplete example to which the rule 
must be applied. This completed the set for the 
first rule. The second logical rule was then dealt 
with by the same three-item procedure. Each of 
the 15 rules was subsequently presented in the 
same manner. 

Following this initial series on the rules, Ss 
were instructed, as in the Initial Program, that 
they were to transform certain given positions of 
symbols into a terminal or winning position by suc- 
cessive application of rules. Complete and incom- 
plete examples of this procedure were given. Ss 
then received a review series, consisting of two 
items per rule. On the first item an incom plete 
verbal description of the operation of the rule was 
given, and S had to supply the term which complet- 
ed the statement correctly. The following item 
presented an incomplete example of the same rule. 
This review procedure was repeated for all 15 
rules. 

The next series of items gave complete exam- 
ples of rules being used in pairs to get to the win- 
ning position. Each item presented problems in 
which the same two rules had to be used to reach 
the winning position. Тһе final series of items 
presented only the given and winning positions, with 
instructions to get to the winning position using any 
rules necessary. During this last series, no com- 
pleted examples or prompts of any sort were pres- 
ent to aid Ss in constructing proofs. The construc- 
tion of short deductive proofs withno external stim- 
ulus support was the principal criterion behavior 
which the programs were developed to produce. 


The Experimental Treatments 


With the programs developed, six experiment- 
al treatments were set up for study. Four treat- 
ments, using the Initial Program, investigated 
characteristics of the S’s response, and can gener- 
ally be classed as investigations of response mode. 
The last two experimental treatments, using the | 
Formal Program, were employed to study the effect 
of variation in technique of program construction, 
as well as the effect of a provision for review. The 
influence of these six treatments on measures of 
learning and retention constituted the major inter- 
est of this investigation. The following paragraphs 


describe the modifications of the two basic pro- 
grams which provided these variations. 

Response Composition (Treatment RC). Under 
this treatment, Ss were required to compose their 
answers to each item. This is the method of re- 
sponding recommended by Skinner (27). No answer 
of any sort was available on the front of the item, 
Ss were required to respond by supplying missing 
terms, working problems, and answering questions, 
In the following item from the Initial Program, the 
task of the S was to provide the two answers indi- 
cated by the blanks. 


Example of a Response Composition Item 


These three signs are called connectors: ү, ~, ^e 

Each connector has a special name to help us re- 
member it. 

This connector is called ‘wedge’: y. 

This connector is called ‘tent’: 


This is called ‘spear’; + . 


After the S had read the item, and had recorded the 
two answers which he consideredcorrect, he turned 
to the back of the index card which contained the 
item, and compared his response with the correct 
answers “А? and ‘connector.’ If his responses 
matched the correct answers on the back, he pro- 
ceeded to the next item. If he made an error, he 
was instructed to circle the error on his answer 
pad and determine why his answer was incorrect 
before proceeding to the next item. 

Multiple Choice Response (Treatment MC). In 
this treatment, the method of responding was that 
utilized by Pressey (22, 23) and involved selecting 
the correct answer from a number of alternatives. 
55 were given items identical with those presented 
in RC. However, at the bottom of each item was a 
set of answers lettered “А,” “В,” “С,” and so 
on. The task in this condition was to select the 
correct answers from this set by writing down the 
letters corresponding to the proper response. The 
following example shows a multiple-choice version 
of the item in the previous example. 


Example of a Multiple-Choice Item 


These three signs are called connectors: V, > ,^: 

Each connector has a special name to help us re- 
member it. 

This connector is called *wedge':v . 

This connector is called ‘tent’: 

This is called ‘spear’: > 

D: connector 


А: V B: A C: symbol 


Ss in this treatment followed basically the same 
procedure as Ss in Treatment RC, except that they 
chose their responses and recorded the corres- 
ponding letters. They then checked their responses 


, 
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against the letters representing the correct answers 
and proceeded as before. Where possible, the al- 
ternate incorrect answers were selected from er- 
rors made on the same items by pilot Ss. 

Implicit Response (Treatment IR). The major 
aspect of this treatment was that Ss did not make 
overt written responses to the items in the Initial 
Program. Pilot work reported by Evans, Glaser, 
and Homme (6) showed no significant differences in 
criterion performance between Ss who recorded 
their composed responses and Ss who confirmed 
their “implicit” responses, but did not record 
them. Тһе program for this treatment was con- 
structed by giving the correct answer or answers at 
the bottom of the item requiringthe response. Hence 
Ss had the correct responses available at all times 
as they studied the items. Their instructions were 
to study the card until they understood why the an- 
swer provided was correct in each case, and then 
to proceed to the next item. They were specifically 
instructed not to write down their answers in any 
form. An example of an implicit-response item is 
as follows: 


An Example of an Implicit Response Item 


These three signs are called connectors: V, > , A^. 

Each connector has a special name to help us re- 
member it. 

This connector is called ‘wedge’: V . 

This connector is called ‘tent’: 

This is called ‘spear’: > . 


Answer: A^ 
connector 


An alternative method would have been to insert the 
correct answers into context in the sentences and 
examples of each item. However, the technique of 
placing the correct answers at the bottom of the 
card was selected, in order to force attention to the 
Same portions of each item as did Treatments RC 
and MC. 

Immediate Feedback (Treatment IF). Most 
items in the Initial Program required morethanone 
response. Ss were not allowed to check their re- 
sponses under RC and MC until they had completed 
all responses to that particular item. This effec- 
tively delayed the feedback as to the correctness of 
individual responses. Even the last response to an 
item was not always confirmed immediately. Ss 
usually checked off their responses with those on 
the back of the card in order, and such a checking 
procedure delayed the confirmation of the last re- 
Sponse. This delay between response and response 
Confirmation under Treatments RC and MC aver- 
aged about two minutes, with a range of 30 seconds 
to five minutes, depending on item difficulty. To 


*The masking procedure is a modification of a technique used by 


Composition by a learning program. 


determine whether such delay of confirmation in- 
fluenced performance, the following procedure was 
devised. A numbered list of answers to all items 
in the Response Composition form of the Initial Pro- 
gram was prepared. This list, with space provid- 
ed for responses, was given to Ss alongwithasmall 
cardboard mask.* Ss were instructed to cover the 
answers with the mask until they had written the 
first response to an item. At that time they were 
to move the mask down until the correct answer 
appeared on the answer sheet. They checked this 
response, and then repeated the procedure for sub- 
sequent responses to that item. In this manner the 
confirmation of a response was given immediately, 
regardless of the total number of responses on that 
particular item. 

The four experimental treatments described 
above employed the Initial Program and four vari- 
ations in the mode in which Ss responded. In sum- 
mary, the conditions were: RC—Ss constructed 
their responses to each item in full. These re- 
sponses were then checked with the answers on the 
back of the card. MC—Ss used a choice response, 
selecting their answers from a set of multiple- 
choice answers appearing at the bottom of the item. 
IR—no overt written response was required. Тһе 
correct answers, separated from the context as in 
RC and MC, appeared at the bottom of the item for 
Stocheck. IF—a masking technique was used to 
reveal the correct answer immediately following 
each response to a response-composition item. 

Formal Program (Treatment FP). The essence 
of this treatment was the use of the Formal Pro- 
gram previously described, in comparison with the 
treatments using the Initial Program. А demon- 
stration that such a program was as effective in 
producing criterion behavior as previous methods 
would facilitate operational specifications of pro- 
gram construction. This treatment was adminis- 
tered exactly like Treatment RC. Responses to 
each item were composed in full and then checked 
against the answers on the back of the card. Incon- 
trast with the Initial Program, in which items typ- 
ically required more than one response, most 
items early in the Formal Program called for a 
single response. The following example illustrates 
a typical item of the Formal Program. The rule- 
example-incomplete example pattern usedthrough- 
out the program is evident in this item. 


Example of an Item from the Formal Program 


This sign is named *wedge': y . 

It is called a connector, since it connects any two 
letters when it appears between them. 

For example, we would write ‘‘m wedge г” like 
this: mV г. 

Now you write “К wedge t": 


Ferster and Sapon (9) in teaching German 
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The correct answer «КУ t” would appear on the 
back of the card on which this item appeared. 


Formal Program « Review Card (Treatment 
ЕР»). Programming procedures developed thus tar 
no provisions for ‘‘memory storage’’ of the 


materials presented by the program. Most pro- 
grams in existence do not have such a provision. 
That is, at the time an Sisgivena review item on 
materials previously covered, he typically is with- 
out any summary, outline, or abstract of such ma- 
terialto prompt his response. The possible rele- 
vance of such a factor was emphasized by responses 
to à questionnaire administered during the early 
phase of experimentation. Several Ss indicated 
that it would have been helpful to have some method 
of reviewing the rules presented in the program. 
The following prócedure was devised to check the 
effect of a provision for review on performance. 

А complete list of examples of all rules was 
prepared, and all such examples were typed on a 
single 9 x 12 inch card, Ss were instructed that 
this list of rules would be available for their use as 
they proceeded through the Formal Program, but 
that ít would not be available during the tests which 
followed the learning program. No other sugges- 
tions for the use of this review card were made. In 
all other respects the administration of this condi- 
tion, called Treatment ЕР», was the same as in 
Treatment FP. 


Construction of the Criterion Measures 


Results of an investigation of program versus 
textbook presentation of the same material (6) indi- 
cated that different experimental treatments may 
produce differential performance as a function of 
the criterion measure used. This finding, coupled 
with the exploratory nature of the present study, 
suggested that several types of criterion tests 
Should be constructed to sample various aspects of 
the behavior produced by the learning programs. 
One ‘‘level’’ of performance would involve making 
discriminations between correct and incorrect in- 
stances of applications of the logical rules. A true- 
false test was constructed for this purpose. Asec- 
ond type of behavior involves recall and application 
of each of these rules when the name of the rule is 
given. А third type of behavior consists of succes- 
sive applications of these rules іп combination to 
produce short deductive proofs. 

In addition to assessing the effects of experi- 
mental treatments on different types of criterion 
performance, it also seemed important to assess 
treatment effect on retention of the behavior learned 
inthe program. The systematic nature of the 
chosen task facilitated the construction of parallel 
retention tests for each of the criterion measures. 

The True-False Tests. The first criterion 
measure was a 15-item true-false test on each of 
the logical rules presented by the programs. A ta- 
ble of random numbers was used to determine 
whether a true or a false example of a particular 


` havior which the programs were primarily designed 


rule would be constructed. Ап example of each 
rule was then prepared in which the last step of the 
example either followed from the rule in question 
or contained some error. Тһе 15-item testsocon- | 
structed was designated “TF1. ” m 
A parallel, but not identical, retention test was 
prepared in the following manner. If a true ex- 
ample of a given rule had been given in TF1, a 
false example of that rule was presented in the r 
tention test, designated ** TF2. '' Also, false ex- 
amples in TF1 were replaced by true examples of 
those rules in TF2. In this way each 5 had to dis- 
criminate one true and one false example of each of 
the 15 rules in the course of the criterion testing. 
The Recall Tests. A second type of behavior of 
interest was the recall and application of each rule 
given the name of the rule and a step or steps on 
which to apply it. A 15-item test was constructed 
by calling for each of the rules in turn. This re- 
call test was designated **R1.'' By changing the 
particular symbols involved, a parallel, but not 
identical, test was constructed for retention pur- 
poses. This test was designated ‘‘R2.”’ г 
The Deductive-Proof Tests. The criterion be- 


to produce was that of using the logical rules in 
combinations to obtain deductive proofs, The fol- 
lowing systematic procedure was used to construct 
15 problems of this type. First, a 15 x 15 matrix 
was constructed with a list of the names of each 
rule forming the axes. Each of the 225 cells of the 
matrix then represented an ordered pair of rules. 
By selecting the necessary initial steps, and then 
applying two rules in succession, a terminal posi- 
tion deducible from the initial position was reached. 
To present this as a problem, the initial positions 
were given, along with the terminal or **winning' 
position. Each S then had to provide the intermed= 
iate steps which constituted the proof. 

Selection of the pairs of rules from the matrix 
was done as follows. Cells were chosen at random. 
However, a constraint was imposed to prevent se- 
lecting two cells from the same row or same col- 
umn, and cells along the major diagonal which rep- 
resented the intersection of each rule with itsélf. 
This procedure resulted in a set of 15 problems 
with the following properties: a) each rule was 
employed in two and only two different problems; 
and b) each rule appeared in the first position in 
one problem and in the second position in some 
other problem. А set of 15 rules so generated was 
designated *DP1." A parallel retention test, DP2, 
was constructed using the same matrix and proced- 
ure. The only additional constraint was that no 
cell be used to generate a problem which had al- 
ready been used іп DP1. 

The Attitude Questionnaire. А short question- 
naire was constructed to assess the reactions of 88 ( 
to the method of programmed presentation of ma- 
terialto be learned. Ss were asked to rate their 
interest in taking a course using programmed ma- - 
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terial, attitude toward the effectiveness of such 
procedures, and opinion of the amount of review 
Which the program provided. Other comments on 
the experiment itself as well as on programmed 
presentation were encouraged. 


Other Experimental Materials 


* 

Each S in Treatments RC, MC, FP, and ЕР» 
received a 3 х б inch answer pad on which to re- 
cord his answers. These Ss were instructed to turn 
to a new sheet on the answer pad when they turned 
to each new item іп the series. This procedure 
was adopted to prevent previous composed respon- 
ses from serving as prompts for asubsequent re- 
sponse to any particular item. The answer sheet 
and mask for Treatment IF have been described 
previously. Ss in Treatment IR, who were not re- 
quired to write downtheir answers, used a 5 x 8inch 
pad to record their rate of responding as described 
in the next section. 


Procedures and Subjects 
A RIeS апа ише 


АП Ss in the present study were University of 
Pittsburgh undergraduate students and were paid 
$4. 00 for completion of the experimental and reten- 
lion sessions. Ss were informed of the general na- 
ture of the experiment. On the average, three Ss 
Were run per experimental session. Ss were as- 
Signed randomly to the experimental treatments, 

At three-minute intervals following the start of the 
Session, E would say, ‘‘Write down your card num- 
bers, please.” This procedure provided a method 
for getting a rate-of-response record, i.e., num- 
ber of cards per three-minute interval. Duringthe 
learning Sessions, Ss were placed so that E could 
Observe easily all phases of their reading and re- 
sponding. This was done to control possible varia- 
tions from the experimental procedure such as 
looking at the back of the card before answering or 
moving the mask before responding under Treat- 
ment IF, Ss took two six-minute breaks outside the 
experimental room. The combination of different 
treatments and different working speeds on the part 
of Ss resulted in a natural staggering of break 
time, so that two Ss rarely had their break together. 

After the second break following completion of 
а program, Ss returned to the experimental room 
to take three performance tests over the pro- 
grammed material. The true-false test was given 
first followed by the recall test and then the deduc- 
tive proof test. Following the completion of each 
individual test, E recorded the time, removed that 
test, and brought the next test. This procedure 
Was adopted to control for Ss using the results on 
Опе test to prompt themselves on another test. Af- 
ter the third test, Ss were given the attitude ques- 
tionnaire. The retention tests were given one week 
later and were administered in the same manner as 
the post-program tests, Time scores on these 

_ tests were again recorded. " 


Results 
Analysis of Time Scores Е 


The following time scores were available for 
analysis: a) total time to complete the learning 
program; b) times spent on each of the three im- 
mediate performance tests; and с) times spent on 
each of the three parallel retention tests, 

Program Times. The total time in minutes 
which each E spent on the learning program was ге- 
corded. Table II presents the means and standard 
deviations of these scores for each of the Six ex- 
perimental treatments, Ss in Treatment IR (no . 
overt responses to the Initial Program) took appre- 
ciably less completion time (over 20 minutes) than 
Ss in the other treatments. Mean learning time for 
the two formal program treatments was practically 
identical. Тһе three Initial Program treatments 
requiring multiple-choice and composed responses 
(MC, IF, and RC) had mean times from about 15 to 
25 minutes more than the means of the two Formal 
Program groups. Considering the four Initial Pro- 
gram treatments together, it appears that requir- 
ing overt responses by Ss increases the mean learn- 
ing time from 40 to 50 minutes as compared with 
implicit responding. 

An analysis of variance of learning times indi- 
cated significant differences among treatment 
means (p <.01). Since the Initial Program and the 
Formal Program differed in method of construction 
and in the total number of items, two further anal- 
yses were made, An analysis of variance for the 
four Initial Program treatments considered sepa- 
rately showed significant differences between means 
(p 4.01), due chiefly to the distance of the mean 
of the implicit-response group from the means of 
the three overt-response groups. The two Formal 
Program groups (FP and FP+), had essentially the 
same learning time. 


Immediate and Retention Performance Test 
Times, Time scores were ауа е on eac the 
three performance tests administered immediately 
after the learning session. Analogous scores were 
available on the three parallel forms of these tests 
administered one week later. The means of the 
immediate and retention test times, separated by 
experimental treatments, are presented in Table 
Ш. Means of the sums of the three immediate test 
times, and of the three retention test times, are 
also presented. Finally, the means of thetotaltime 
taken on all immediate and retention tests summed 
together are given. 

Total Test— By treating the immediate and re- 
tention tests as separate trials, a repeated-meas- 
ures analysis of variance (4) could be performed on 
thetime scores. Such a design permitted three 
sources of variation to be tested for significance: 
a) differences due to experimental treatments; b) 
differences between trials (immediate performance 
versus retention); and c) interaction of treatments 
and trials. The first set of scores to be so ana- 
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TABLE І 
NAMES AND EXAMPLES OF FIFTEEN RULES IN SYMBOLIC LOGIC INCORPORATED IN 
THE LEARNING PROGRAMS 
Example Copi Moore and Anderson Program 
1. p 
2. руч Addition Addition Wedge Adding 
1. p 
2 q [ 
3. рла Conjunction Adjunction Tent Joining { ) 
Js prq 
2. р 
3. q Modus Ponens Modus Ponens Spear Separation 
1. pAg 
2: р Simplification Detachment А Tent Disconnecting 
1 p-q 
2 qr 
Hypothetical Transitivity of 
3. pr Syllogism  . Conditional Double Spear 
1 руа 
2 avp 
or 
1; p^d 
2. q^p Commutation Commutativity Letter Exchange 
1 pq 
2. qp Contraposition Contraposition Spear Exchange 
1: pvp Not Given Not Given Wedge Bar 


Continued.....-- 
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TABLE I (Continued) 


Example Copi Moore and Anderson Program 
Ls PAGAL 
2 PAQGAT 
or 
ds Pvayr 
2. рма Мг Association Association Line Shift 
rf P Vaar 
2. руалрмг 
or 
1, pAQavr 
2. р AQ VPAT Distribution Distribution Pairing Rule 
1 p 
2 p 
_or 
1 p 
2. p Double Negation Not Given Double Bar 
l ржа 
2. pva 
or 
1 P va t 
Definition of Definition of 
2 р-ға Material Implication Conditional Spear Wedge 
E 
1 р->а ы 
Y 
2. pag ^ 
or 
1 pag 
2 p>q Not Given Not Given Spear Tent 
1. pag > 
2. руа 
i or 
uo De Morgan's De Morgan's 
2. pag Theorem Laws Three Bar |... 
NENNEN ek o — À———7 wm. 2: женіп т La E R 
1. рур 
2 р Tautology T Iteration Wedge Repeat 
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TABLE II 


MEANS AND STANDARD DEVIATIONS, IN MINUTES, OF TOTAL LEARNING 
TIME SPENT ON PROGRAMS 
(Treatments have been ordered by mean time) 


Experimental Treatment Mean S. D. 
Implicit Response (IR) 81.8 16.9 
Formal Program (FP) 103. 6 22.1 
Formal Program + Review Card (FP+) 104.1 22.2 
Multiple Choice (MC) 121.6 23.1 
Immediate Feedback (IF) 127.1 19.6 
Response Composition (RC) 13271 17.4 
TABLE III 


MEAN TIME IN MINUTES ON EACH OF SIX PERFORMANCE TESTS FOR 
EACH OF SIX EXPERIMENTAL TREATMENTS 


M ————————————— 


Treatment 
Test RC MC IR IF ЕР ЕР» 
TF1 5.0 5.0 9.0 6.5 5.1 5.4 
TF2 4.2 4.3 4.4 4.5 5.3 4.4 
R1 5.6 6.2 7.5 5.6 4.7 4.5 
R2 4.4 4.3 3.9 5.0 4.2 3.6 
DPi 21.5 20.5 31.2 28.6 20.3 20.4 
DP2 21.2 19.9 19.8 24.2 18.3 20.3 
TF1 + R1 + DP1 32.1 31.7 47.7 40.7 30.1 30.3 
ТЕ? + R2 + DP2 29.8 28.5 28.1 33.9 27.8 28.3 
Total Time 61.9 60.2 75.8 74.6 57.9 58.6 
TABLE IV 


ANALYSIS OF VARIANCE OF PERFORMANCE TIMES UNDER SIX EXPERIMENTAL 
TREATMENTS USING TOTAL IMMEDIATE AND RETENTION TESTS 
AS SEPARATE TRIALS 


Mean 

Source df Square F P 
Between treatments 5 332. 65 2.40 < .05 
Between Ss іп same group 54 138. 35 

Total between Ss 1559 
Between trials 1 1092. 03 19.93 <.01 
Interaction: trials X treatments 5 236. 81 4.32 < .01 
Interaction: pooled Ss x trials 54 54. 79 

Total within Ss Ў 60 


Total 119 


“АЛА лыы. “айалы ааыа с жаа... — - е 
есе = — — — 


BEEN EIL. X ————————————Áue—— OO M TENER CMM 
lll „„—— ртр б 
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lyzed were the total immediate time scores (TF1 + 
R1 + DP1) and the total retention time scores (TF2 
+ R2 + DP2). Results of this analysis are present- 
ed in Table IV. All three sources of variation show 
significant effects, 

Considering the effects of the experimental 
treatments, it is interesting to note that the implic- 
it-response condition (IR), which was completed in 
markedly less learning time, had the highest mean 
time for completion on the three immediate per- 
formance tests. Such a difference disappeared 
over the retention interval; mean total times on the 
retention tests were similar for all experimental 
treatments. АП six treatments exhibited statisti- 
cally significant reductions in mean total test time 
over the retention interval. Тһе significant inter- 
action effect appears to be due chiefly to the fact 
that the mean total test time for Treatment IR 
dropped almost 20 minutes over the retention in- 
terval, while the meantimes for the other five 
treatments dropped only two to six minutes. 


True-False Tests— The same method of an- 
alysis was applied to the scores of the three types 
of performance tests considered separately. The 
results of the analysis of true-false scores were 
essentially the same as for the total time scores on 
the three tests together. All three sources of var- 
iation were again significant. The mean time for 
the implicit-response group agaom was consider- 
ably higher than that of the other treatments on the 
immediate post test. However, on the retention 
test all treatment means were similar. With one 
exception, all treatment mean times were less on 
the retention tests than on the immediate post tests 
and this effect was significant. 


Recall Tests—In the repeated-measures analy- 
Sis of time scores on the recall tests (R1 and R2), 
trial effects and trial х treatment interaction were 
again significant, but a treatment effect was absent 
inthis case. Examination of mean scores on the 
recall tests in Table III reveals a reduction inmean 
time on the retention test for all treatments, with 
Treatment IR again showing the highest immediate 
test time and the greatest drop over the retention 
interval. 

Deductive-Proof Tests— The results of a simi- 
lar analysis applied to time scores on the deduc- 
tive-proof tests (DP1 and DP2) showed that neither 
the treatment effect nor the trial x treatment inter- 
action effect was statistically significant. However, 
all treatments showed a significant drop in mean 
test time over the retention interval, as in the pre- 
vious analyses. The mean of the IR group was 
highest on the immediate performance test, as be- 
fore, and again was similar to the other treatments 
on the retention test. Despite differential reduc- 
tions in mean test times over the retention interval, 
the trial x treatment interaction effect was not sta- 
tistically significant. 

Since significant differences due to treatments 


were present in the analysis of mean total time 
(Table IV), and three of the four trial X treatment 
interactions so far considered were also signifi- 
cant, another analysis was performed considering 
each of the three immediate tests and each of the 
three retention tests separately. Results of a one- 
way analysis of variance on each of the six time 
Scores are presented in Table V. Significanttreat- 


TABLE V 
SUMMARY OF VARIANCE ANALYSES OF TIMES 


SPENT ON THREE IMMEDIATE AND THREE 
RETENTION PERFORMANCE TESTS 


Score F (for 5 and 54 df) P 
ТЕ1 4.50 <.01 
TF2 0. 74 NS 
Ri 1.56 NS 
R2 1.23 NS 
DP1 3. 36 «.05 
DP2 0. 88 NS 


ment effects were found on the immediate true- 
false and the immediate deductive-proof tests, but 
not on the recall tests. All such differences on 
time scores due to treatment effects had disap- 
peared by the time of the retention tests. 

In general, mean time scores showed consist- 
ent and significant reductions between the immedi- 
ate and the retention forms of the various criterion 
tests. Experimental treatments produced signifi- 
cant effects on the immediate true-false test times 
and the immediate deductive-proof test times, with 
Treatment IR consistently showing highest mean 
times. No significant treatment effects were found 
on the immediate recall test. 


Analysis of Error Scores 


Error scores, like time scores, were avail- 
able both on the responses made during the learn- 
ing program and responses made during the im- 
mediate and retention performance tests. 

Program Errors. Inalltreatments except the 
implicit-response (IR) condition, Ss recorded their 
responses to each item as they proceeded through 
the program. Such responses were Scored as cor- 
rector incorrect. A summary of these error 
Scores is presented in Table VI. It willbe recalled 
that the Initial Program consisted of 72 items, and 
the Formal Program consisted of 125 items. How- 
ever, the total number of responses was consid- 
ered a better basis for comparison than the total 
number of items. The Initial Program called for 
189 responses; the Formal Program called for 151 
responses. These figures were used to compute 
«‹регсепі errors рег response" in Table VI. 

Inspectionof Table VI reveals that almost twice 
as many errors were made under TreatmentRC as . 
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TABLE VI 


MEAN ERRORS AND MEAN PERCENT ERRORS PER RESPONSE ON THE LEARNING 
PROGRAMS FOR FIVE EXPERIMENTAL TREATMENTS? 


Treatment 
Initial Program? Formal Program 
Mean RC MC IF FP ЕР» 
Errors 34.3 18.1 30.7 25.9 17.1 
Percent errors per response 18.1 9.5 16.2 17.1 11.3 


аЕггогѕ under Treatment IR are not available since Ss did not record their responses. 


b189 responses in 72 items. 
€151 responses in 125 items. 


TABLE VII 


MEAN ERROR SCORES ON EACH OF SIX PERFORMANCE TESTS FOR EACH 
OF THE SIX EXPERIMENTAL TREATMENTS 


—————————————————— E — — M 


Treatment 
Score RC MC IR IF FP ЕР» 
TF1 5.4 5.8 6.5 6.2 6.0 5.2 
TF2 3.4 3.7 3.7 3.7 3.8 4.9 
R1 3.2 4.9 3.9 4.1 4.6 4.0 
R2 4.2 5.6 4.0 4.8 6.2 3.9 
DP1 8.1 9.9 7.8 1.9. 7.5 6.5 
DP2 iti t 9.6 ШЕЛ, 7.8 8.2 7.1 
TF1 + R1 + DP1 16.7 20.6 18.2 18.2 18.1 15.7 
TF2 + R2 + DP2 15.3 18.9 15.4 16.3 18.2 15.9 
Total Errors 32.0 39.5 33.6 34.5 36.3 31. 6 


were madeunder Treatment MC. This appears to be 
in line with a common finding that is more difficult 
to construct a response correctly than it is to rec- 
ognize such a response (18). It does not appear 
that providing immediate contirmation of individual 
responses (Treatment IF) reduces to any marked 
degree the total learning errors as compared with 
а delayed-confirmation treatment (RC). 

For the two Formal Program treatments (FP 
and FP+), provision of a **memory storage” device 
in the form of a review card appears to decrease 
learning errors somewhat. This seems reasonable 
since such a review card should provide Ss with an 
additional prompt to increase the probability of 
correct responding to review items in the program. 

Treatment RC and Treatment FP both involved 
the same sort of response composition, although 
each employed a different form of the experimental 


program. Percent errors per response were ap- 
proximately the same for the two treatments. It 
appears that construction of programs by formal 
techniques (Formal Program) produc es approxi- 
mately the same percentage of learning errors as 
produced by the Initial Program. 

Immediate and Retention Error Scores. Each 
of the three immediate performance tests and each 
of the three parallel retention tests contained 15 
items, making a total of 90 items for each 5. Up- 
on completion of the experiment, each of the items 
was scored, and the errors on each test were to- 
taled and used as the index of performance on that 
test. Table VII presents the mean error scores 
for each of the six treatment groups on each of the 
six performance tests. Means of the total number 
of errors for the three immediate and three reten- 
tion tests are also presented, as well as meantotal 
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TABLE VIII 


ANALYSIS OF VARIANCE OF ERROR SCORES UNDER SIX EXPERIMENTAL 
TREATMENTS USING TOTAL IMMEDIATE AND TOTAL RETENTION 
TESTS AS SEPARATE TRIALS 


Mean 

Source df Square F P 
Between treatments 5 43.67 --- NS 
Between 56 in same group 54 112. 92 

Total between Ss 2159 
Between trials 1 46. 88 7.51 E 
Interaction: trials x treatments 5 6.97 1.11 NS 
Interaction: pooled Ss x trials 54 6.24 

Total within Ss 60 

Total 119 

TABLE IX 


CHI-SQUARE ANALYSIS OF THE RELATIONSHIP BETWEEN EXPERIMENTAL TREATMENT AND 
ATTITUDE TOWARD TAKING A COURSE USING PROGRAMMED LEARNING 


Treatment 
Attitude RC MC IR IF FP ЕР» Total 
Like 4 4 8 5 6 T 34 
Indifferent 2 3 1 1 0 1 8 
Dislike 4 3 1 4 4 2 18 
Total 10 10 10 10 10 10 60 


Chi-square - 2.99; this value is not significant for 10 degrees of freedom. 


error for all six performance tests combined. 

The method of analysis of error scores pro- 
ceeded in the same manner as the analysis of time 
Scores. The immediate and retention tests were 
treated as separatetrials, andarepeated-measures 
analysis of variance was made for both total scores 
and scores made on the three types of performance 
tests separately. 

Table VIII presents results of the analysis of 
total immediate and retention error scores. T he 


only statistically significant effect was due totrials. 


It is of interest that the total number of errors 
made after a one-week retention interval (1000 er- 
гогв) was less than the total number of errors 
made immediately after the learning session (1075 


errors). With respect to other effects, it appears 
that experimental treatments had little effect oner- 
ror score, either as a primary source of variation 
or in combination with trials as a trial x treatment 
interaction. 

True-False Test—Further analyses were made 
of error scores by separating the true-false, re- 
call, and deductive-proof scores. Analysis of er- 
ror scores from the true-false test (TF1 and TF2) 
gave results essentially the same as in the previ- 
ous analysis. Again the only significant main effect 
was that attributable to trials. As before, the num- 
ber of errors made after the retention interval (232) 
was significantly less than the number made on the 
immediate test (351). Effects due to treatments 
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and interaction effects were negligible. 

Recall Test—In the analysis of the recall tests 
(R1 and R2) the only statistically significant source 
of variation was that due to trials. Here, however, 
the previous finding was reversed. Significantly 
more errors (287) were made after the retention 
interval than at the time of the immediate perform- 
ance test (247). This effect was opposite in direc- 
tion from that of the true-false and total scores. 

Deductive-Proof Test— The last analysis was 
that of the deductive-proof scores (DP1 and DP2). 
No significant differences due to any of the testable 
sources of variation were found. Treatment and 
trial х treatment effects were absent and moreover, 
no effect due to trials was found. Тһе total number 
of immediate errors (477) on the deductive-proof 
tests was virtually the same as the total number 
made on the retention test (481). 

In summary, the three types of performance 
tests yielded different results over the retention in- 
terval. The true-false test showed significantly 
fewer errors after one week; the recall test showed 
significantly more errors and the deductive-proof 
test showed no significant change in error score. 
In no case were significant effects due to experi- 
mental treatments or due to an interaction between 
trials and treatments found for error scores. 


Properties of the Criterion Measures 


One of the purposes of the present study was to 
develop a satisfactory set of criterion measures of 
behavior learned in the programs. Such measures 
Should be reliable and should discriminate between 
different levels of performance. The results of anal- 
yses of such properties are summarized here and 
presented in detail in Evans' dissertation, Univer- 
sity of Pittsburgh, 1960. 

Reliability. Split-half and test-retest reliabil- 
ity coefficients for total immediate and total reten- 
tion error scores were computed. Reliabilities 
were computed for each experimental treatment 
separately, yielding a total of 12 split-half reliabil- 
ities for total test scores, one for each immediate 
and retention test. Nine of the 12 split-half relia- 
bilities coefficients were 0. 92 or above, and all 12 
were 0. 84 or above. Thus the reliability of boththe 
immediate and retention measures developed inthis 
study are quite satisfactory for experimental pur- 

poses. Of the six test-retest coefficients, one was 
0. 66 while the remaining reliabilities ranged from 
0.88 to 0.98. Such findings support the position 
that immediate and retention tests can be treatedas 
separate trials of the same task, andjustify repeat- 
ed-measures analysis. 


Subject Characteristics and Performances 


In addition to time and performance scores, 
several other measures of a more qualitative na- 
ture were available on each S for analysis. Analy- 
ses of the effect of sex, mathematical experience, 


and college class on performance were made. 

Sex. Inall, 27 male and 33 female Ss partici- 
pated in the present study. То investigate the pos- 
sible relevance of the sex variable on performance, 
each S's total error score was taken as the overall 
index of test performance. Scores were then di- 
vided into two groups on the basis of sex. A one- 
way analysis of variance indicated no significant 
difference in error scores attributable to sex. 

Mathematical Experience. Records were avail- 
able on the number of high-school and college math- 
ematics courses which each S had taken. The total 
number of courses in mathematics which each S had 
taken was used as an index of ‘‘mathematical ex- 
perience.’’ These scores in turn were correlated 
with total error scores on all six performance 
tests. The overall correlation coefficient between 
mathematical experience and total error score was 
-0.11. Apparently little relationship exists be- 
tween performance measures and this index of 
mathematical experience. 

College Class. An attempt was made in re- 
cruiting to take only freshman and sophomore 55. 
However, scheduling difficulties necessitated the 
inclusion of several juniors, seniors, and special- 
class students. To check the possibility that col- 
lege experience per se might be relevant to per- 
formance, a chi-square analysis of total error 
Scores was made. Ss were divided at the median 
error score, and then cross-classified as to col- 
lege class. Тһе obtained chi-square value was not 
significant. 


The Attitude Questionnaire 


А questionnaire was administered after the 
learning programs to check the possibility that the 
experimental treatments had differential effects on 
the attitudes of Ss toward certain aspects of this 
technique of learning. Responses made to three 
items on the questionnaire were analyzed. The first 
item sampled the attitude of Ss toward taking a 
course which would employ a learning program. 
The ‘‘definitely like” and the ‘‘like somewhat’’ cat- 
egories, as well as the **dislike somewhat" and 
“definitely dislike” categories were combined to 
insure adequate expected values for the cells in 
Table IX. The obtained chi-square value was not 
significant. A similar chi-square analysis was 
performed to see if any relationship was present 
between experimental treatments and ratings by Ss 
as to whether they thought they could have learned 
“better, the same, or not as well’’ by more con- 
ventional methods of presentation. А chi-square 
value was not significant. In order to assess the 
attitude of the students toward the amount of review 
іп the program, they were asked to judge whether 
the amount of review was “іоо much, about right, 
or too little." Again, the obtained chi-square val- 
ue was not significant. 
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Discussion 


Implications of the Specific Findings 


One very general statement can be made con- 
cerning the six experimental treatments selected 
for the present study. With respect to statistical 
significances obtained, the effect of the experiment- 
al treatments was on time scores but not on error 
scores. Treatment effects on time scores were 
present both in learning time and in time spent on 
criterion performance measures. Treatment ef- 
fects on error scores, however, were in no case 
significant. Even though all treatments produced 
essentially the same criterion performance, they 
led to differential effects on learningtime. Ss mak- 
ing no written responses to the Initial Program 
(Treatment IR) finished in about 20 minutes less 
time, on the average, than did the two groups who 
composed their response to the Formal Program 
(FP and FP+). Mean completion times for these two 
groups were practically the same. The three re- 
maining treatments (MC, IF, and RC), all requir- 
ing a written response to the Initial Program, yield- 
ed learning times from 15 to 25 minutes longer than 
the two Formal Program treatments. However, 
the number of individual responses called for by the 
Initial Program (189) was greater than inthe Form- 
al Program (151). 

It appears that with implicit responding, Ss сап 
complete their programs in about 65 percent of the 
time taken by Ss who must record some composi- 
tion or multiple-choice response. This finding is 
consistent with results obtained by Evans, Glaser 
and Homme (6) using a program which presented 
fundamentals of music. 

With respect to the two Formal Program treat- 
ments (FP and FP+), availability of a review card 
apparently had almost no effect on mean time to 
complete the program. Informal observations by E 
during experimental sessions revealed frequent use 
of the review card by Ss. In addition, eight of the 
ten Ss under Treatment FP» indicated that they 
found the card “уегу helpful’ or **extremely help- 
ful." The other two Ss were observed to make use 
of the card, but they did not record a comment on 
its usefulness. Since the FP+ group did spend time 
using the review card, and yet took about the same 
total time as did the FP group, the inference canbe 
made that the review card must have increased to 
Some degree the rate of responding on the program 
itself. 

Additional treatment effects were in evidence 
оп the time scores for the performance tests. Ex- 
amination of these time scores reveals two facts: 
а) the implicit-response group, who took from 20 
to 50 minutes less learning time than the other 
treatments, consistently took more time on each of 
the three immediate performance tests; and b) all 
Such differences in performance times disappeared 
inthe retention tests. All treatments showed con- 


sistent drops in completion time for all tests over 

the retention interval, and such drops were statis- 
tically significant. The reduction in mean time for 
the IR groups was always greater than the corres- 

ponding reductions for the other groups. Such dif- 
ferences in the magnitude of the drop appear to ac- 
count for the observed trial x treatment interaction 
effects. 

Considering time and error scores together, 
the following picture of the effects of the six exper- 
imental treatments in the present study emerges. 
By allowing Ss to respond implicitly to program 
items which presented correct responses at the 
bottom of such items (Treatment IR), marked re- 
ductions in learning times were obtained with no 
significant decrement in immediate or retention 
performance as indicated by error scores. Ss ар- 
parently took time to *^warm ир” to overt respond- 
ing on the performance tests, as reflected in long- 
er immediate performance times for this group. In 
any event, such increased times to complete the 
performance measures were not accompanied by 
increased errors, and the treatment effect on time 
had disappeared by the retention measure one week 
later. 

Combined time and error results on the two 
Formal Program treatments (FP and FP») indicate 
that treatments involving overt responding to a 
formally-constructed program can produce, in less 
learning time, criterion performance comparable 
to that produced by treatments involving overt re- 
sponses to a less systematic program. Provision 
of a “memory storage" device for review purposes 
during the learning session reduced both learning 
errors and criterion errors, but the reduction in 
the latter was not significant. The presence of а 
review device during learning apparently had no 
systematic effect on performance times. 

Ss making multiple-choice responses to pro- 
gram items (Treatment MC) had somewhat shorter 
learning times and somewhat more performance 
errors than Ss who composed their answers in full 
to the same program. 

Finally, provisions for ‘immediate feedback" 
following composed responses to the Initial Pro- 
gram (Treatment IF) appeared to have little effect 
on learning time and learning errors, as compared 
with a treatment which delayed such feedback from 
about 30 seconds to five minutes (Treatment RC). 
The group receiving immediate confirmation of re- 
sponses made slightly more errors and took more 
time on the performance measures than the group 
whose confirmation of responses was delayed until 
completion of an item. 

A satisfactory discussion of the obtained find- 
ing must account for the observed differences in 
time scores and the absence of such differences in 
error scores, as well as the differential retention 
results found for the different types of perform- 
ance measures. Differences in time scores due to 
treatments will be presented first. With respect 
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to time scores, the most pronounced effects were 
associated with Treatment IR. It is not surprising 
that it takes longer to compose a response, record 
it, and then check it than it does to compose the ге- 
sponse and to check it without recording. Also, on 
more difficult items, Ss responding implicitly could 
prompt themselves with the correct answer imme- 
diately, while Ss responding overtly had to produce 
and record their response in the absence of such a 
prompt. During criterion tests, Ss who had been 
responding overtly continued such overt behavior, 
while implicit-responding Ss were writing out sym- 
bols, rules, and proofs for the first time. The 
lack of overt practice may have delayed times on 
immediate tests. Apparently the practice session 
with the immediate tests was adequate to bring the 
rate of overt responding up to that of the other 
groups during retention tests. 

Considering only the five overt-responding 
treatments, the two Formal Program groups took 
less learning time, but also had fewer total re- 
Sponses to make than the three remaining Initial 
Program treatments. Without pressing the prob- 
lem of the size of a verbal response unit too far, it 
can be stated that all five overt-response treat- 
ments averaged very close to 1.5 responses per 
minute, compared with 2. 3 responses per minute 
for the implicit-response group. In this light, the 
Formal Program groups appear to have been re- 
sponding at approximately the same overall rate as 
the Initial Program groups during the learning 
phase. Performance times for these treatments 
are quite similar for all six performance tests, 
with the possible exception of Treatment IF. On the 
immediate true-false and deductive-proof tests, Ss 
in this group took somewhat more time than Ss in 
the four other overt-response groups. 

Two premises of modern learning theory are 
that organisms learn by doing, and that organisms 
learn best when correct responses are followed by 
immediate confirmation or feedback (5). In the 
present study, Ss learned an overt task while re- 
sponding implicitly, and delays in confirmation up 
to five minutes had little effect oncriterion perform- 
ance. These anomalous results may be due to a 
fundamental difference between programmed and 
non-programmed approaches to the study of verbal 
learning. Classical techniques for the investiga- 
tion of verbal learning are characterized by the 
precautions taken to prevent learning from occur- 
ring. Nonsense-syllable lists are standardized for 
low-association value; concept-formation problems 
contain irrelevant stimulus dimensions; problem- 
solving tasks are selected for their novelty. Asa 

consequence, a characteristic of the initial stages 
of such learning is the number of response errors. 
For any increase in the probability of correct re- 
sponding to occur, differential feedback as to the 
adequacy of such responses is obviously necessary. 
In a learning program, however, the attempt is 
made to arrange a series of stimuli so that succes- 


sive responses have a high probability of being cor- 
rect from the beginning. As the learning progres- 
ses, the supporting stimuli or prompts are ‘‘faded’’ 
or withdrawn, but only at such a rate that correct 
responses continue to be emitted. At the termina- 
tion of an **ideal" program, criterion responses 
should be under the control of the minimum set of 
stimuli which set the occasion for such responses. 

If it is true that the stimulus portion of each 
item sets up a high probability that S will emit a 
correct verbal response, the problem of channel- 
ing such a response into any number of modalities 
is almost trivial. That is, if an S is adequately 
prepared to emit a verbal response such as “Тіп- 
coln, ” the correlation of responses will be almost 
perfect whether he is required to write it, type it, 
say it aloud, recognize it from a list, or write it 
inthe sand with his toe. In the same vein, imme- 
diate confirmation of such a response should cease 
to be a critical factor, since S has, as it were, al- 
ready confirmed the correctness of sucharesponse 
himself. Evidence in support of this latter point is 
indicated by observations that Ss did not always 
turn to the back of items to ascertain the correct- 
ness of certain responses. 

Thus, the relevance of variables such as re- 
sponse mode and immediacy of confirmation may 
be inversely related to the probability of correct 
responding. That is, in situations in whichcorrect 
responses have low probability, factors such as 
overt responding and immediate feedback are more 
critical than in situations in which probabilities of 
correct responding are high. Тһе absence of sig- 
nificant effects on error scores of the four ‘‘mode- 
of-response" treatments (RC, МС, IR, and IF) in 
the present study is clearly in line with this hy- 
pothesis. Also in line are the results of the earlier 
study of overt versus implicit responding. Such а 
hypothesis would also account for the failure of 
Silberman and Coulson (26) to obtain significant 
differences between composition and multiple- 
choice responding to an elementary psychology 
program. For non-programmed situations such as 
serial, paired-associate, or multiple-choice learn- 
ing, in which initial correct-response probabilities 
are low, the relevance of factors such as immedi- 
acy of feedback would be predicted. Results ob- 
tained by Saltzman (25) and Little (17), confirm the 
necessity for immediate feedback while learning 
non-programmed material. 

The differential results obtained on the three 
performance tests after the retention interval will 
now be considered. With respect to the true-false 
tests, it should be pointed out that not until the Ss 
reached the test itself were they required to clas- 
sify examples as being correct or incorrect. This 
proved to be a difficult task as evidenced by the 
large number of errors made on this test. Some 
Ss even scored below chance level on this 15-item 
examination. It is conceivable, however, that 
practice on this type of test is good preparation for 
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future tests of the same type. This may account 
for the sizeable drop over the retention interval in 
the number of total errors (351 to 232). The poor 
immediate performance on the true-false test may 
imply that to produce effective behavior on exams 
of the true-false and multiple-choice types, the 
program itself must provide specific practice on 
such items. Тһе second test in the series required 
Ss to recall a rule when given its name, and apply 
it to one or more given steps. On the retention 
tests the total number of errors increased from 247 
to 287, or an average of about 0. 67 errors per S. 
On the deductive-proof test, the increase in total 
errors after the retention interval was negligible 
(477 to 481). Like the recall test, the deductive- 
proof test involved recall and application of the log- 
icalrules. Apparently Ss sometimes remembered 
the operations involved in a rule, but had difficulty 
in recalling the name for that operation. This would 
account for the slight but significant increase in 
errors when the operations had to be recalled, giv- 
en only the name, as in the recalltest. The fact 
that ability to perform the operations involved in a 
rule suffered no particular loss over the retention 
interval is reflected in the similarity between im- 
mediate and retention performance onthe deductive- 
proof test. 

Considering the retention data together, it ap- 
pears that there is no marked decrement in per- 
formance over the retention interval, even though 
in the recall test an average increase of less than 
one error per S proved to be statistically signifi- 
cant. Тһе decrease іп true-false error scores 
over the interval is perhaps attributabletothe prac- 
tice effect received in taking the immediate true- 
false test, a type of performance not practiced in 
the program itself. Absence of any pronounced in- 
crease in the number of retention errors indicates 
that the behavior produced by the learning programs 
Was present in approximately the same strength 
after the one-week interval. If this was the case, 
then the facilitating effect of having taken three 
very similar immediate tests should have reduced 
the completion times on the retention tests. The 
finding of consistent and significant reductions in 
completion times over the retention interval is evi- 
dence for this conclusion. 

In summary, the chief implication of the re- 
Sults of the present study for the area of verbal 
learning is as follows: Failure to obtain perform- 
ance decrements attributable to variables such as 
non-overt responding and delay of feedback neces- 
Sitated a re-examination of the nature of such vari- 
ables in programmed learning. А distinction was 
made between situations in which probabilities of 
correct response were high throughout the learning 
period and situations with low initial probabilities 
of correct responding (e.g., nonsense-syllable 
lists, concept formation). It was hypothesized that 
the relevance of variables such as response mode 
апа immediacy of confirmation was inversely re- 


lated to the probability of correct responding. 


Program Technology 


Among the implications of the present investi- 
gation for the area of ‘‘program technology’’ was 
the demonstration that specifiable sequences of 
“¢standard’’ item-types (e. g., rule-example-incom- 
plete-example) can produce, in less learning time, 
the same level of criterion performance as pro- 
grams constructed according to less formal prin- 
ciples and procedures. Such results, if confirmed 
by other studies in other topics, might lead, in the 
long run, to procedures for programming knowl- 
edge for humans which are as rigorous and system- 
atic as programming procedures for digital com- 
puters. 

More immediately, the ease with which items 
can be added to the Formal Program permits ex- 
panding it to improve criterion performance. Of 
the programming variables reported in the litera- 
ture, the ‘‘size-of-step’’ variable has been most 
consistently related to improving performance (7, 
26). Since the two Formal Program groups (FP 
and FP+) had learning times from 20 to 30 minutes 
less than the three overt-responding Initial Pro- 
gram groups (MC, IF, and RC), such saved time 
could be used to present additional items. 

Another procedure for increasing the number 
of items in a program is associated withthe finding 
of significantly less time for the implicit-response 
treatment (IR). Again, since savings in time can 
be accomplished without performance decrement 
by allowing implicit responses, programs could be 
expanded so that total learning time is approximate- 
ly the same as for overt-response treatments. For 
the present task, a combination of an expanded 
Formal Program with implicit responding should 
permit learning time to be reduced by one-half as 
compared with overt responding to the Initial Pro- 
gram. Under these circumstances, a large num- 
ber of additional items or steps could be added 
without increasing the average learning time. 

A disadvantage of allowing implicit responding 
for programming research is that S leaves no rec- 
ord of his responses. As Skinner (27) has pointed 
out, a salient feature of learning programs is the 
progressive modification and improvement follow- 
ing analyses of recorded responses. In the devel- 
opmental phase of a program, the necessity for 
recording S’s responses still remains. After a 
program is producing satisfactory criterion behav- 
ior, it may be desirable to require recorded re- 
sponses to some items but to permit im plicit re- 
sponding to the remaining items. 1f such a pro- 
cedure produced no performance decrements, ap- 
preciable savings in completion times might be 
gained without loss of assurance that S is respond- 
ing correctly. In addition, the reduced number of 
recorded responses would consume less space and 
perhaps permit simplification of the device which 
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receives such recorded responses. 

Another result that has possible implications 
for program technology comes from a comparison 
of response-composition (RC) and multiple-choice 
(MC) responding to the Initial Program. Total per- 
formance errors under Treatment MC were 25 
percent greater than total performance errors un- 
der Treatment RC. Because of large within-group 
variability such differences were not statistically 
significant, but a mean difference of 7. 5 errors per 
S between the treatments deserves some comment. 
Such a difference favoring composed or constructed 
responses would be predicted by Skinner (27), who 
states that incorrect multiple-choice answers on an 
item may compete with the correct response. How- 
ever, if the hypothesis relating correct-response 
probabilities and mode of response holds, it would 
follow that as the probabilities of correct respond- 
ing on а program increase as the program is suc- 
cessively improved, response-mode differences 
Should decline. 

A brief comment is also in order as to the ef- 
fect of the ‘‘memory storage” device used in Treat- 
ment FP+. This treatment, involving the Formal 
Program plus the use of a review card containing 
the logical rules, produced the fewest total per- 
formance errors of the six treatments studied. It 
is interesting to note that Ss on the same program 
without the card (FP) made 15 percent more total 
performance errors, and about 50 percent more 
learning errors than the FP+ group. Althoughthese 
differences were not statistically significant, the 
fact that provision of a summary type prompting 
device reduced both learning and performance er- 
rors deserves further study. 

There are several interesting implications of 
the present investigation. Since delay of confirma- 
tion from 30 seconds to five minutes resulted in no 
significant performance decrement, what would 
happen if such confirmation were withheld altogeth- 
er? If the previous analysis of situations involving 
high correct-response probabilities is correct, it 
is possible that satisfactory performance can be 
attained without providing confirmation at all. A 
possible complication here is that such external 
confirmation may not be necessary for learning per 
se but may be necessary to ‘‘motivate’’ such be- | 
havior over the course of a program. That is, the 
primary function of the confirming stimulus may 
not be the strengthening of the response just emit- 
ted. Rather such stimuli may serve to maintain 
such responding until the program is completed. 

Another related variation involves the proced- 
ure used in the implicit- response treatment (IR). It 
will be recalled that the correct answers to each 
item were removed from the context of the item and 
placed at the bottom of that item. An alternative 
procedure would be to leave such answers and so- 
lution in context. Ss in such a treatment would 
presumably make their own implicit responses to 
items in much the same manner as they would when 


studying from a text. Results on performance 
measures would indicate whether the ‘‘blanks’’ and 
spaces which signal responses in а program are 
necessary to direct the attention of Ss to the criti- 
cal aspects of that item. 

Another question can be raised with respect to 
implicit responding. Much verbal behavior is so 
complex that Ss appear to need the stimulus support 
provided by their own recorded responses to com- 
plete a response correctly. Examples of this would 
be drawing a complicated electrical circuit or 
Sketching a complex organic compound. Whenunits 
of behavior of this size constitute the criterion be- 
havior, overt responding during thelearning phases 
may be necessary. The ease with which complex 
problems can be generated for the logic task used 
in the present investigation provides a technique by 
which the relation between response complexity and 
implicit responding could be studied. 


Summary and Conclusions 


Programs designed to teach the construction of 
short deductive proofs in symbolic logic were ad- 
ministered to six independent groups of ten college 
students each. Two experimental treatments in- 
volved a systematic program in which both the type 
and sequence of items followed the same pattern 
for each logical rule. Both groups using this pro- 
gram composed or constructed their answers to 
each item. One group did, and one group did not, 
use a review card containing all the logical rules. 
The remaining four treatments used a less system- 
atic program previously developed. Four different 
modes of responding to the items of the program 
were used. One group wrote out their responsesto 
items in the program. А second group also com- 
posed their answers, but received immediate 
knowledge of results on items involving more than 
one response. А third group had the correct an- 
swer present on the front of the item and were not 
required to make written response. A fourth group 
selected the correct response from a set of multi- 
ple-choice answers at the bottom of the item. 

A true-false test, a test involving recall of 
each of the rules, and a test requiring short deduc- 
tive proofs were constructed to sample different 
aspects of the behavior learned. These tests were 
administered after the experimental learning se- 
quence, and three parallel retention tests were 
given after a period of one week. Ап attitude ques- 
tionnaire toward the experiment was also adminis- 
tered after the learning session. 

Dependent measures were: time spent on the 
learning programs, time spentonthe performance 
tests, and number of errors made on the perform- 
ance tests. А 

The following conclusions are drawn on the 
basis of analysis of the data obtained: 

1. Experimental variations in mode of respond- 
ing significantly affect learning time. Ss not re- 
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quired to make an overt written response to each 
item can complete a learning program in about 65 
percent of the time required for composed or mul- 
tiple-choice responding. 

2. Criterion performance in terms of error 
Scores is not significantly affected by mode of re- 
sponding, including no overt responding at all. 

3. Systematically constructed programs can 
produce, in less learning time, criterion perform- 
ance comparable with that of a less systematic pro- 
gram. 

4. Ss who respond covertly to learning pro- 
grams take significantly more time on performance 
tests which immediately follow the program than do 
Ss who make their responses overtly. Such differ- 
ences in test time disappear after a retention per- 
iod of one week. 

5. Differential retention effects were observed 
as a function of the type of criterion performance 
measured. Error scores on true-false tests de- 
creased significantly; error scores on recall tests 
showed slight but significant increases; on tests 
involving deductive proofs no significant changes 
were observed. 

6. No significant relationships were observed 
between performance following the progam m ed 
learning sequence employed, and sex, mathemati- 
cal experience, or college class. 

7. Implications of the results for the area of 
verbal learning were discussed. It was hypothe- 
sized that the relevance of variables such as re- 
Sponse mode and immediacy of feedback are in- 
versely related to the probability of correct re- 
Sponding in the course of learning. 
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CONCERNING THE 


NEEDED NEXT STEP IN AUTO-INSTRUCTION 


On the basis of laboratory research involving tasks and situations very 
different from those іп the schools, publishers and manufacturers have raced 
into production of programs and devices for use there. The much overdue 
next step should be to investigate very critically whether, with meaningful 
materials in school situations, constructed responses really are morelearn- 
ing-producing in a given time than objective items, many very easy items 
more effective than fewer difficulty-resolving ones, a series of many sepa- 
rate ‘‘frames’’ better than continuing discourse. 


1f current indications are confirmed, many fewer but elucidative objec- 
tive items designed on the basis of analytical research as learning produc- 
ing, used to aid rather than replace books and other current instructional 
materials, will be found best. Most current programs and devices will then 
be made obsolescent; but this latter approach will be found both more widely 
effective and more practicable. 


SIDNEY 1. PRESSEY 
Ohio State University 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 9, June-July 1962) 


A Comparison of Three Modes of Presenting 
A Programmed Instruction Sequence: 


COMPARISONS between teac hing machines and 
programmed texts as presentation modes have been 
made by Eigen and Komoski (3), Holt and Hammock 
(6), Roe et al (8), and Silverman and Alter (1). In 
each of these studies, the machine was compared to 
either a horizontal programmed text orto a vertical 
programmed text. This study compares the ma- 
chine with both the horizontal and vertical text for- 
mats. 

A teaching machine presents anordered sequence 
of instruction to the learner one frame at a time. 
After responding to a stimulus frame, the learner's 
answer is immediately confirmed or corrected. 
The learner then proceeds to the next frame; he is 
prevented from changing his previous answers or 
going back in the program. 

In a horizontal programmed text, succeed ing 
frames appear on alternate pages. The learner 
writes his answer to the frame either in the рго- 
gram booklet or on a separate answer sheet. He 
then turns the page, and the correct response is re- 
vealed along with the next stimulus frame. A hori- 
zontal text is read from the front of the book to the 
back, across the pages, at one level. After com- 
pleting the top level, the learner reads across the 
text at the next lower level. This procedure is соп- 
tinued until the entire text has been read. 

Reading a horizontal programmed text involves 
much page turning. Allthe frames on any one level 
are allotted the same amount of space. This space 
limitation may influence the programmer's con- 
struction of his frames; i.e., he may tend to make 
all frames of a uniform length. In a sense, these 
Space limitations control the programmer's behav- 
lor. When frame lengths vary, shorter frames will 
‘waste’? space. From the standpoint of printing 
Costs, this is an important consideration. 

Vertical programmed texts are read from the 
top of the page to the bottom. Тһе number of 
frames appearing on one page may vary from one to 
lllo WA o s M ed 
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ten or twelve. A mask or slider is used with this 
mode of presentation. The learner exposes one 
frame at a time, together with the answers to the 
preceding frame, by moving the slider down the 
page. Aseach new frame is exposed, the learner 
writes his answer either in the textbook or on a 
Separate answer sheet. 

The vertical programmed text does not require 
as much page turning as does the horizontal text; 
the problem of space limitation is also avoided. In 
both the horizontal and vertical formats, however, 
the programmer's control over the learning se- 
quence may be easily broken by the learner's turn- 
ing ahead or back in the text, Neither of these 
modes of presentation is **cheat-proof''; both re- 
quire a maximum of self-control on the part of the 
learner. 

The question of control of the learner's behavior 
has been discussed in the literature for some time. 
Skinner (9) and Holland (7) have stated that the 
teaching machine gives more precise control over 
the behavior of the learner than. does the pro- 
grammedtext. Evans, Glaser, and Homme (4) have 
commented that with a well-written program, ma- 
chine control may not be necessary. Eigen (2), 
basing his stand on previous research, which has 
failed to demonstrate differential effects between 
presentation modes, has advocated that the pro- 
grammed-text format be used in initial field trials 
in school systems. 

Each of these points of view has merit. The 
learner using a teaching machine must follow the 
“рай” presented by the programmer. Therefore, 
with this mode of presentation, the learner’s be- 
havior is almost entirely under the control of the 
programmer. With programmed texts, this control 
may be diluted. The learner using a programmed 
text may deviate from the path set down by the pro- 
grammer by looking back in the program or looking 
ahead. However, this diminution of control may be 
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compensated for if the program is well-written, if 
it holds the learner's interest, if the subject matter 
is not overly difficult for the learner, and if the 
learner himself is sufficiently mature to exercise 
self-control. 

The major hypothesis of this study is: Mastery 
of subject matter is independent of presentation 
mode; i.e., there is no significant difference in 
mastery between machine, horizontal-text, or ver- 
tical-text presentations. 


Subjects 


: Seventy-seven eighth-grade boys and girls in the 
Hanover, New Hampshire, public schoo 1 system 
served as subjects for the experiment. Their IQs 
(Pintner) ranged from 73 to 142 with the mean at 
118. 

Because of the limited number of machines a- 
vailable for this study, 18 subjects were randomly 
selected for the machine presentation before the re- 
mainder were randomly assigned to the two pro- 
grammed-textbook treatments, Thenumber, mean, 
variance, УХ and УХ? of IQ scores for each of the 
three treatment groups—machine (M), horizontal 
text (H), and vertical text (V)— are given in Table 
I. Ananalysis of variance of the IQ scores indi- 
cated that the three groups were drawn from the 
same population. 


Program 


The program, ‘‘Numbers and Numerals,” * 
was mimeographed in three formats, on fan-fold 
paper for insertion in the Rheem-Didak teaching 
machine, as a horizontal programmed textbook, 
and as a vertical programmed textbook. The pro- 
gram consists of 65 frames. The machine-group 
Subjects were provided with a special panel which 
incorporated material from eleven answer frames 
that were too long to fit the machine. 


Procedures 


Subjects in the machine group worked individual- 
ly in sound-insulated booths inan annex of the Dart- 
mouth Medical School. Eachsubjectused a Rheem- 
Didak 501 teaching machine. Тһе day before the 
experiment began, the machine subjects were given 
a one-hour orientation lecture onthe use of the ma- 
chines by Dr. Edward J. Green of the Department 
of Psychiatry, Dartmouth Medical School. For 
both the orientation lecture and the experimental 
session, these subjects were transported to the 
Medical School annex by school buses. 

The V and H groups worked with the program- 
med-text formats in the cafeteria of the Hanover 


Junior-Senior High School, a familiar environment 
for allof these subjects. Each programmed-text 
group was instructed in the proper use of its own 
particular format. 

A ten-minute pretest consisting of true-false 
items and paragraph-correction items was admin- 
istered to all subjects before they were permitted 
to begin the program. During thefirstfive minutes 
of the experiment, faculty members were permitted 
to answer questions about the use of each format 
but not about the program's content. 

As each subject finished the program, his time 
was recorded. After all subjects had completed 
the program, a group posttest was administered. 
Four weeks after the completion of the program, 
all subjects were given the delayed posttest. 


Results 


This study tested the hypothesis that mastery of 
subject matter is independent of presentation mode; 
i.e., there is no significant difference in mastery 
between machine, horizontal-text, or vertical-text 
presentation. As summarized in Tables II through 
XI, the results of this study indicate that the null 
hypothesis cannot be rejected, 

The average pretest knowledge, in terms of the 
number of errors, was approximately the same for 
students tested with each mode of presentation (Ta- 
ble П). There are no significant differences among 
the groups in posttest achievement (Table III). Dif- 
ferences in gain (the number of posttest errors 
minus the number of pretest errors) were analyzed 
using a one-way analysis of variance. Again, no 
statistically significant difference between the 
groups was observed (Table VI). However, learning 
didtake place within each of the presentation modes 
(Table IX). The average gain (reduction of errors) 
was approximately four to five points. Within each 
of the presentation modes, the difference between 
mean pretest and mean posttest scores is statisti- 
cally significant beyond the . 001 level. 

On each of the two parts of the posttest, there 
are again no significant differences among the 
groups (Tables IV and V). 

The delayed posttest was administered four 
weeks after completion of the program. There- 
sults of this testing (Table VIII) again show no sta- 
tistically significant differences among the groups. 
However, it is interesting to note that retention is 
greatest for those individuals who usedthe machine 
and least for those who used the vertical-text pre- 
sentation mode. 

The experimenters in this study found that sub- 
jects using programmed texts complete the pro- 
gram in less time than those using machines. This 
is in keeping with previous studies comparing ma- 
chine with programmed-text presentation as re- 


* This program is a unit (Chapter 1, Book 1) in Modern Mathematics, L. D. Eigen, J. D. Kaplan, and R. 
Emerson, Chicago: Science Research Associates, Inc., 1961. 
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EIGEN 
TABLE I 
IQ 
M H у 
ax 2,097 3,231 3,760 
zx? 249,165 380, 361 463, 500 
X 116. 500 115. 393 121. 290 
о? 286. 147 278. 149 248. 320 
N 18 28 31 
F = . 323 N. S. 
1 way analysis of variance, 2 and 74 degrees of freedom 
TABLE II 
PRE-TEST TOTAL (PRE) 
M H v 
EX 180 269 316 
хх? 1,984 2,695 3, 484 
x 10. 000 9. 607 10. 194 
о? 10. 824 4.718 8. 757 
N 18 28 31 
Е-.343 N.S. 
1 way analysis of variance, 2 and 74 degrees of freedom 
TABLE III 
POST-TEST (PTT) 
M H у 
=x 89 160 162 
zx? 617 1,300 1,256 
X 4.944 5.714 5.226 
о? 10.411 14.287 13.646 
N 18 28 31 
= .323 N.S. 


1 way analysis of v 


ariance, 2 and 74 degrees of freedom 
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TABLE IV 
POST-TEST PART A (PTA) 


M H у 
=x 17 25 23 
=x? 39 51 45 
x 0.944 0. 893 0. 142 
а? 1. 351 1.062 0. 931 
N 18 28 31 

= .266 N. S. 


1 way analysis of variance, 2 and 74 degrees of freedom 


TABLE V 
POST-TEST PART B (PTB) 


M H v 
2x 72 135 142 
2Х 1.422 903 930 
x 4. 000 4. 821 4. 581 
о? 7. 882 9. 339 9. 317 
N 18 28 31 

F = .457 N.S. 


1 way analysis of variance, 2 and 74 degrees of freedom 


TABLE VI 
GAIN* 
=x 267 451 475 
DX? 4155 1665 1829 
X 14. 83 16.11 15.32 
o? 11.441 14. 840 18. 359 
N 18 28 31 
F =. 622 N. S. 


1 way analysis of variance, 2 and 74 degrees of freedom 


*20 has been added to each gain score to facilitate computa- 
tion. The absolute gains in general are negative since there 
was a reduction in number of errors from PRE to PTT. 
Thus Xj = 20 + PT1j - PREi 
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TABLE VII 


TIME (IN MINUTES) 


M H v 

zx 541 768 833 

DX2 16,593 21,468 23,297 
x 30.05 21,43 26.87 
о? 19.59 14.92 30. 45 

N 18 28 31 
t= 2.02 %-2.17 

ғ-2.742 N.S. 


1 way analysis of variance, 2 and 74 degrees of freedom 


TABLE VIII 


DELAYED POST-TEST (DPT) 


M H M 
УХ 108 182 249 
Руа 996 1814 2709 
х 6.35 7.28 8.03 
о? 18. 233 19.536 22.944 
N 17 25 31 
F-.728 N.S. 


1 way analysis of variance, 2 and 70 degrees of freedom 


TABLE IX 


COMPARISON OF PRE-TEST AND POST-TEST 


M H v T 
Mean Pre- Test 10. 000 9. 607 10, 194 9.935 
Mean Post-Test 4,944 5. 714 5.226 5.337 
-5. 056 -3. 893 -4. 968 -4,598 


Difference 


4. 657 4. 804 5. 843 8.958 
27 30 76 


t* 


Degrees of Freedom 17 


Level of Significance P<. 001 P<. 001 P<. 001 P<. 001 


* The null hypothesis tested by t is: The mean PRE is equal to the 


mean PTT. 
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viewed in Goldstein and Gotkin (5). 

Although the three presentation modes do not dif- 
fer significantly in amount of time taken to com- 
plete the program (Table VII, subjects using the 
horizontal or the vertical-textformats took less 
time than did those who used machines. 

Although the program used in this study is brief " 
there is a wide variation in time taken to complete 
the program (Figures 4 and 5). With a longer pro- 
gram, the time differences between the various 
modes of presentation may become even more dis - 
parate. 

The performance of the vertical-programmed- 
text subjects on the posttest and delayed posttest is 
relatively independent of IQ and reading level, as 
evidenced by the correlations (Table X). However, 
for the machine and horizontal-programmed-text 
groups, IQ and reading level are Significantly re- 
lated to immediate and delayed posttest perform- 
ance. The difference between machine and vertical- 
text format in correlation of IQ with posttest per- 
formance is significant (p<. 05). These findings 
suggest an interaction effect of IQ and presentation 
mode. Table I shows the two-way analysis of vari- 
ance (IQ, split at the mean, versus machine and 
vertical-text modes) which yields an interaction 
approaching significance (. 10>р>.05). It is sug- 
gested that further research geared specifically to 
testing this effect is necessary in order to resolve 
the issue, 


Discussion and Conclusions 
ee eee onciustons 


This study demonstrates, as have previous in- 
vestigations, that differences between machine and 
programmed-textbook presentation modes are not 
reflected in mastery of programmed subject matter, 
In addition, this investigationhas shownthat differ- 
ences between the vertical and the horizontal-text 
formats are not evidenced in differences in learn- 
ing. The time-saving feature of programmed texts 
as compared with machine presentation, as demon- 
Strated in previous studies, is shown once again to 
exist. 

The loss of the programmer's control over the 
learner's behavior in the textbook versions of the 
program does not seem to hamper mastery of the 
programmed material. Either the programmer’s 
control рег se is not important, or else the dimi- 
nution of the programmer's control in the text ver- 
sions has been compensated for in some manner. 

Observations of the subjects as they were work- 
ing with the program and discussions with them af- 

ter the program had been completed indicate that 
difficulties were encountered with two modes of 
presentation. Those who worked with machines in- 
dicated that the equipment occasionally malfunc- 
tioned and that other operating difficulties occurred. 
Horizontal-text subjects complained chiefly about 
the excessive amount of page turning necessary to 
go through the program. Мо complaints were elic- 


ited from those subjects who used the vertical-pro- 
grammed-text format, nor was any undue difficulty 
with this version observed by the partic ipating 
teachers. 

Past research has indicated that “feedback 
Should be immediate for maximally efficient learn- 
ing to take place..." In this investigation, the 
most immediate feedback would be effected with the 
vertical-programmed-text format since the subject 
need only move his slider down the page to reveal 
the answer to the previous frame. In this format, 
information is generally most easily accessible; 
previous and future frames and their answers are 
easily revealed. With the horizontal-text format, 
some searching through the text is involved if the 
subject wishes to ‘‘review. ”’ Accessibility of in- 
formation is quite restricted in the machine mode, 
as employed in this study, since only one frame at 
a time is revealed and the subject cannot go back in 
the program nor can he go ahead until he responds 
to the frame on which he is working. The relation 
of accessibility of information to mastery of pro- 
grammed subject matter deserves further intensive 
study. 

Results obtained in this study were based on the 
administration of a rather brief program. It would 
be interesting to discover whether any differences 
in a program of increased length would yield results 
at variance with those observed. The fact that dif- 
ferences inrateoccur with a program as short as 
the one used indicates that classroom administra- 
tive difficulties encountered in the utilization of 
long programs will also obtain with shorter pro- 
grams. 

This study supports the view that both vertical 
and horizontal-text presentation modes are as effi- 
cient and effective as machines in bringing about 
mastery of programmed subject material. In ad- 
dition, although the difference is not statistically 
significant, the text modes produce a saving in 
time. 

Since no significant differences, in mastery and 
time, are obtained between the horizontal and ver- 
tical-text formats, the question of which format to 
select must be resolved on the basis of other cri- 
teria. The vertical-text format does not restrict 
the programmer in the size of the frame he wishes 
to produce; it entails less page turning for the 
learner; and it is less expensive to produce from 
the standpoint of printing costs. Except for pro- 
grams which incorporate a large number of copy- 
ing frames, such as those which teach spelling, it 
seems reasonable to suggest that programs be pub- 
lished preferably in the vertical-text format. 


Summary 


The primary interest of this study was centered 
on the relation between presentation mode and mas- 
tery of programmed instructional material. 

Seventy-seven eighth graders with mean IQ of 


EIGEN . 


TABLE X 
MATRIX OF CORRELATION COEFFICIENTS FOR M, H, AND V GROUPS AND TOTAL SAMPLE 


3 — 0—À ——— — = 
.933** -[nes* ED -fAno* -,521% / 639* М -{162* 
S 885** 4 581** -.424* {йт -1269 514+ Sage suae 
.894** -2260 -.113 -3147 -.439* -M66 -^246 2 150 
.897%% -.479% -.222 -.294 -.400%% -.426** -.422** si 416** 
—} 
.787%ж -.468 -.344 -. 480* -[648* | - ы - ж 
Reading e -.526** -.301 -.302 (ee. - ER - Mie 
Level M31 -.221 . 040 -.421% -1094 -.106 -. 067 
-.449** a -.366** -.166 -.397** -.413% -.389%% -.396%% 
Post- .493* .495* nui .517* .935** .T23** 
Test .232 .859** .293 PALE Si 4911%% ,534** 
Total .194 .132%% -.069 .653%% .984** .615** 
.264* ‚140** .076 .657%% 1 .971%% .600% 
Рге- -.502% -.020 .400 .401 ‚260 
Test -.298 .022 .192 .222 ‚201 
Total -.526%% .161 4147 .209 -,034 
-.451** .077 .227* .250* ‚110 
.216 .113 .522% .481% 
Gain .216 .656%% .842** .418% 
Total -.171 .464% ."08** ‚551** 
.032 ‚451** ‚122** .481%% 
-.045 .223 „457 
(In each cell the upper coefticient is for Time .319 .255 + 266 
group M; the second for group H; the third =, 025 -. 074 +009 
for group V; and the fourth for the total +087 -059 32 
sample.) 
* Significant beyond the . 05 level. T 
** Significant beyond the 101 level. Post- .180 .514% 
A black line connecting two correlation со- Test .617%% . 530** 
efficients in the same cell indicates that Part A .551%% ‚350 
the correlation coefficients differ signif- .415%% ‚434** 
icantly beyond the . 05 level. қстан та VI va NH 
Post- .618* 
Test .482%% 
Part B ‚511** 
.552%% 


Delayed Post- 
Test Total 


TABLE XI 


MEAN POST-TEST SCORES FOR M AND V GROUPS 
SPLIT INTO HIGH AND LOW IQ GROUPS* 


Source of Variation Sum of Squares dt Mean Squares F 

IQ 26. 03 1 26.03 2,18% 
M and V Groups .91 1 .91 . 08 
Interaction 23.71 1 23.71 1.99** 
Within 536. 62 45 11.92 

Total 587.27 48 


ж The mean IQ (118) was used to split the group. The high IQ group includes all subjects with IQ 


'eater than 118. 2 d 
xx Both of these critical ratios are “marginally” significant. That is, .10>P>.05. 
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118 were assigned to machine, horizontal-text, and 
vertical-text modes of presentation. The 65-frame 
program used was designed toteachdifferences be- 
tween numbers and numerals. 

Two major findings evolve. Inter-mode differ- 
ences in learning are not statistically significant. 
Subjects using the programmed texts complete the 
program іп less time, although not significantly so, 
than do subjects who use the machines. 

It is suggested that the interaction between IQ 
and presentation mode, and accessibility of infor- 
mation be studied more closely. An additionalsug- 
gestion is made that, except for programs consist- 
ing mainly of copying frames, the vertical-pro- 
grammed-text format be used. 


REFERENCES 


1. Alter, M. and Silverman, R. E. Re sponse 
Mode, Pacing, and Motivational Effects in 
Teaching Machines, New York: Department of 
Psychology, New York University, 1961. 

2. Eigen, L. D. *Some Problems in Field Testing 


Programs for Teaching Machines, " The Jour- 
nal of Educational Sociology, 1961, 34 (8), рр. 
372-376. 
Eigen, L. D. and Komoski, P. K. Research 
Summary Number 1, New York: Automated 
Teaching Project, Collegiate School, 1960, 
Evans, J. L., Glaser, R., and Homme, L.E, 
An Investigation of ‘Teaching Machine’ Vari- 
ables Using Learning Programs in Symbolic 
Logic, Pittsburgh: Department of Psychology, 
University of Pittsburgh, 1960. 
Goldstein, L. S. and Gotkin, L. G. Review of 
Research: Teaching Machines Versus Pro- 
grammed Textbooks as Presentation Modes, New 
York: Centerfor Program ed Instruction, 1961. 
Hammock, J. and Holt, H. O. Booksas Teach- 
ing Machines : Some Data, Murray Hill, N.J.: 
Bell Telephone Labs, 1961. 
Holland, J. G.‘‘Evaluating Teaching Machines 
and Programs,’’ Teachers College Record, 
1961, 63 (1). pp. 56-65. 
Leeds, D., Massey, M., Roe, A., and Weit- 
man, G. Automated Teaching Methods Using 
Linear Programs, Los Angeles: Automate 
Learning Research Project, Department of En- 
gineering, University of California, 1960. 
Skinner, B. F. “Teaching Machines," Science, 
1958, 128 (3330), pp. 969-977. 


:2 


THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 9, June-July 1962) 


A Study of Nine Methods of Presentation of 


Programmed Learning Material 


SOME OF THE RESEARCH on teaching machines 
and programmed learning has attempted simultane- 
ous comparisons of several devices and variations 
of programmed material within one experimental 
arrangement. Thus Roe et al. (3) used а mechani- 
cal teaching machine, a simple masking device, 
constructed responses, multiple choice responses, 
programmed text, a program with blanks filled in, 
and programmed lectures and found no significant 
difference in learning among the different methods. 

The present study is like the latter in that nine 
experimental variations in the type of device and 
the manner of programming were used. A single 
hypothesis was tested: there will be no differences 
in the amount learned among the experimental 
groups, all of which will be taught with material 
which is in the style of a linear program — or 
Some form derived from the linear program — and 
which will be taught with a mechanical teaching ma- 
Chine, with a simple masking device, or with no 
device at all. 


Procedure 


Thirty college students from regular classes in 
general psychology served as subjects in each of 
the experimental and control groups. The ratio of 
men to women was approximately three to five. 
Seven of the nine experimental groups and both con- 
trol groups were tested and treated as classroom 
groups in a classroom setting; two groups received 
WE instruction, as will be described 
ater. 


JOHN F. FELDHUSEN 
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The Teaching Materials and Tests 


One general topic with eight basic concepts was 
selected to be taught in all the experimental and 
one of the control groups. This topic was con- 
cerned with teaching machines and programmed 
learning. It was selected because a quite uniform 
freedom from previous learning was found in col- 
lege sophomores (not used as subjects in the study) 
who were given a special fifteen-item pretest on 
knowledge of teaching machines and programmed 
learning. 

The linear program developed for use in this 
study consisted of 37 frames with one to five 
blanks per frame and a total of 55 blanks in the 
whole program. Individual error ratios ranged 
from two percent to 27 percent, with a mean of 
14 percent for a group of 30 subjects. The con- 
cepts covered in the program included: stimulus 
and response as they relate to a program; the na- 
ture of the device; the segmentation of learning 
material; the arrangement of frames in a program; 
the need for active participation of the learner; the 
role of feedback; the importance of low error ra- 
tio; and the importance of individual rates in learn- 
ing. This program served as the base from which 
experimental variations were derived. 

All of the groups except one control group were 
pretested with a fifteen-item test of knowledge of 
concepts related to teaching machines and pro- 
grammed learning. This test was used again as 
the post-test for all groups. The test was devel- 
oped to measure learning of the concepts covered 


in the program. 
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Treatment of Experimental and Control Groups 


GrouplA. Subjects in this group came to the 
testing room one at a time and were instructed as 
individuals on mechanical teaching machines. The 
Min/Max machine manufactured by the Grolier 
Corporation was used for half of the group, and 
Teaching Machine Number 2002 manufactured by 
Foringer and Company was used for the other half. 
Each subject came alone to the testing room and 
was given instructions individually. The standard 
linear program typed on 8 1/2 x 42 inch stock for 
the Foringer Machine and on 8 1/2 x 11 inch stock 
for the Min/Max was used. This included blanks . 
in the program, constructed responses, immediate 
feedback of answers, low error ratio, and learn- 
ing at individual rates. Simple, standardized in- 
Structions were used here, as in later treatments, 
merely to show the subject how to use the device 
or how to proceed. Тһе fifteen-item test was ad- 
ministered individually immediately before and 
after the learning sequence. 


Group 1B. Subjects in this group came to the 
testing room one at a time. А simple substitute 
teaching device consisting of a manila folder with 
an opening for an exposure frame and a second 
opening with flap to serve as a feedback frame was 
used to present the program. The subject was іп- 
Structed to open the feedback flap after respond- 
ing to each frame and to close it after noting the 
correct answers. He was also shown how to move 
the program through the folder and how to write 
his responses on a separate sheet which was pro- 
vided. The program, typed on 8 1/2 x 13 inch 
Stock, consisted of three pages. Thus, it was 
necessary to remove the program from the folder 
after completing the last frame on a page, fold 
back that page, and reinsert the program with the 
new page up in the device. The program was 
otherwise identical to the standard linear program 
used in 1А. Again the students worked at individ- 
ual rates, and pretests and post-tests were ad - 
ministered individually. 


Group 2. This group was tested and taught as а 
group in the classroom setting. The same substi- 
tute teaching device described in 1B, and the same 
linear program described in 1A and 1B, were used. 
The program was in the form described in 1B to 
facilitate its use in the manila folder. The sub- 
jects worked at individual rates and were pretest - 
ed and post-tested. Тһе post-test was given as 
soon as a subject finished the program. 


Group ЗҮ. This group was taught with the 
same linear program used for Groups 1A, 1B, 
and 2 and with the same folder type of teaching 
device used for Groups 1B and 2. However, no 
feedback of right answers was offered, the answer 


Strip having been removed from the program. 


Group 3Z. This group was treated in exactly 
the same way as 3Y except that the feedback was 
offered but only after the subject had worked 
through the complete program. An answer sheet 
was given to the subject as he completed the pro- 
gram, and he was told that he could check his re- 
sponses against this key. 


Group 4. This group was taught with the same 
linear program as that described for previous 
groups. However, the blanks were already filled 
with correct responses. Actually, no semblance 
of blanks appeared, since this program was re- 
typed to look like a series of statements. The 
folder teaching device was used in a modified form 
which included only an exposure frame. Subjects 
were instructed to read each frame, worked as a 
group in a classroom setting, proceeded at indi - 
vidual rates, and were pretested and post-tested 
immediately before and after the program. 


Group 5C. This group used exactly the same 
program as Group 4 with blanks filled in, but the 
folder teaching device was not used and subjects 
were merely given the material and told to read it. 
АП other aspects of the classroom setting, indi- 
vidual rates, and pretest and post-test were the 
same as for Group 4. 


Group 6D. This group was taught with special 
narrative material which was derived from the 
program. Each concept in the basic program was 
taught with three to six frames, and those frames 
which taught one concept were put together as a 
paragraph with the blanks filled in. Minor changes 
were needed in wording to create continuity or to 
avoid obvious repetition. Тһе resulting paragraphs 
were assembled in the same order as the concepts 
were taught in the program. This was then pre- 
sented to the subjects with simple instructions to 
read the material. Again the subjects were treat- 
ed as a group in a classroom, but each worked at 
his own individual rate. Тһе pretest and post-test 
were administered immediately before and after 
the narrative material. 


Group 8. This group was treated in exactly the 
same way as Group 2 except that their rate of 
progress through the program was paced, not in- 
dividual. Initial trials with a group not used in 
this study revealed mean times per frame of thirty 
to fifty seconds when subjects worked at their own 
rates. The time per frame used in this experi- 
mental treatment was derived from those means. 
The examiner gave instructions at the beginning of 
the program that it would be necessary for all 
subjects to move on to the next frame when time 
was called and that no one was to advance to the 
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next frame before time was called. 


Group 7 (Control). No instruction or treatment 
of any kind was administered to this group. Atthe 
beginning of a regular class period the pretest was 
administered, and thirty minutes later the post- 
test was administered. Тһе regular instruction of 
the day was offered during the interval, and this 
instruction had nothing to do with teaching ma - 
chines and programmed learning. Тһе purpose of 
this control group was to determine how much 
learning would occur from the pretest. 


Group 9 (Control) The treatment was identical 
with that of Group 2 except for the omission of the 
pretest. The purpose of this control was to ascer- 
tain the effects of the pretest in creating a set to 
learn in the subsequent treatment. 


Results 


The Bartlett test for homogeneity of variance 
was used with the pretest and the post-test scores 
(1, p. 195) for the nine experimental groups. The 
chi squares of 6. 68 and 12.12 for the pretest and 
post-test scores, respectively, are not significant 
atthe five percentlevel. Thus, it appears safe to 
assume that the experimental groups were drawn 
randomly from populations with common variance. 

Inasmuch as the correlation between pretest and 
post-test scores, .260, was significant at the one 
percentlevel, and no attempt was made to match 
groups on the pretest, an analysis of covariance 
design (1) was employed to test the hypothesis that 
there would be no differences among the post-test 
means of the nine experimental groups. 

The pretest means, the post-test means, and 
the standard deviations of pretest and post-test 
Scores for the nine experimental and two control 
groups are reported in Table І. Тһе pretestmeans 
varied from 3.40 for Experimental Group 1B to 
4.63 for Experimental Group 3Z. An analysis of 
Variance for the ten pretest means as reported in 
Table II revealed no significant difference. The 
post-test means varied from 11.43 for Experi - 
mental Group 4 to 12.50 for Experimental Group 
lA. An analysis of variance for the nine experi- 
mental groups is reported in Table III. The F of 
:99 between groups is not significant. 

An analysis of covariance was then used as à 
further test for differences among the post-test 
means which might be attributable to differences 
àmong the pretest means. From this analysis of 
Covariance, reported in Table IV, itis seen that 
there is no significant difference among the ad- 
justed post-test means. The hypothesis of no dif- 
ferences among the means of the experimental 
8roups is therefore supported. 

Control Group 7 was used to ascertain the learn- 


ing increment which might be attributed to the pre- 
test. This was of particular concern because the 
same fifteen-item test was used as both pretest and 
post-test. This group received no instruction, but 
a thirty minute interval elapsed between the two 
tests, during which a normal classroom lecture 
unrelated to teaching machines and programmed 
learning was given. The pretest mean was 4. 00, 
the post-test mean, 4.03. The difference of .03 
was found not to be significant. 

Control Group 9 was used to test the power of 
the pretest to induce a set to learn. This control 
effort was not critical since all the experimental 
groups received the pretest. It may be noted that 
the resulting post-test mean of 12. 50 for this Con- 
trol Group 9 was slightly higher than the post-test 
mean of 12. 33 for Experimental Group 1B which 
was treated in exactly the same way as Group 9 
except that 1B received the pretest. 


Discussion 


The results of this study are in agreement with 
those reported by Roe et al. (3) in that no signifi- 
cant difference was found between Group 1A which 
used the standard linear program in a teaching ma- 
chine or Group 1B which used the standard linear 
program in a folder-type device and Group 4 which 
used a linear program with blanks filled inand pre- 
sented in a folder device. The means of the first 
two groups were 12. 50 and 12. 33, while the mean 
for the group whose programs had the blanks filled 
in — and who therefore had no active participation 
— was 11.43. Roe and his associates used as one 
experimental variation a programmed textbook 
with blanks filled in. The programmed text and 
the procedure used in this study were not identical, 
in that here the subject read each frame through 
an opening in the folder device and also moved the 
program through the folder by sliding it upward. 
Yet the two approaches are similar in requiring no 
active participation of the subject. 

The folder device used in this study to present 
the linear program may be considered similar to 
the simple maksing device used by Roe et al. How- 
ever, their masking device apparently did not af- 
ford a simultaneous view of the stimulus frame 
and the correct responses as was the case with the 
folder device used in this study. Nevertheless, the 
results of the two studies agree in finding no sig- 
nificant difference between groups who were taught 
with standard linear programs in mechanical ma- 
chines and groups taught with simple folder devices. 

The favorable learning effects obtained with the 
material used in Groups 4, 5С and 6D шау result 
from the original analysis and care in preparation 
of the material as a program from which the state- 
ments and paragraphs were derived and from the 
constant attention to behavioral objectives in the 
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TABLE I 


MEANS AND STANDARD DEVIATIONS FOR ALL EXPERIMENTAL AND CONTROL GROUPS 


TF _ eee 


S.D. S.D. 
Experimental Group Post-test Pretest Post-test Pretest 
Means Means Scores Scores 


__—_-——————-—_. 


1A Standard linear program on 
mechanical machine, done as 
individuals at individual rates 12.50 3.97 2.01 1.84 


1B Standard linear program on 
folder teaching device, done as 
individuals at individual rates 12.33 3. 40 2.42 1.56 


2 Standard linear program on 
folder teaching device, done as 
a group at individual rates 12.40 4. 40 1.70 1.60 


ЗҮ Standard linear program on 
folder teaching device, done as 
a group at individual rates, no 
feedback offered 11.77 4.27 2.24 1.47 


32 Standard linear program on 
folder teaching device, done as 
a group at individual rates, all 
feedback at end of program 11.93 4. 63 1. 62 1.68 


4 Standard linear program with 
blanks filled in, presented in a 
special folder device with a viewing 
frame but no feedback frame 11.43 3. 80 1.94 2.09 


5C Standard linear program, blanks 
filled in, subjects read program, 
no device used, done as group at 
individual rates 12.40 4.20 1.52 1.82 


6D Linear program material restored 
to paragraph form, no device, no 
blanks, done as a group at individual 
rates 11.97 3.67 1.67 1.57 


8 Standard linear program on folder 
teaching device, group paced 12.13 4.00 2.23 1.81 


7 (Control) Just given pretest and 
post-test 4.03 4.00 1.99 1.65 


9 (Control) Same treatment as Group 
2 but no pretest 12.50 Eust 1.78 аф 


5 —— JM ————— ÀBBGG n! 
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TABLE II 


ANALYSIS OF VARIANCE OF PRETEST SCORES FOR ALL GROUPS EXCEPT CONTROL GROUP 9 


== € 


Source of Variation Sum of Squares df Mean Square Dev. F 
„л ЕЕЕ 

Between groups 35 9 3.89 1.27 

Within groups 887 290 3.06 

Total 922 299 


TABLE III 


ANALYSIS OF VARIANCE OF POST-TEST SCORES FOR ALL EXPERIMENTAL GROUPS 
NOT INCLUDING CONTROL GROUPS 7 AND 9 


Eoo], OM ТТТ 


Source of Variation Sum of Squares df Mean Square Dev. F 
HEN 0 02 2---------2-2-----2203л4 

Between groups 31 8 3.88 .99 

Within groups 1024 261 3.92 

Total 1055 269 


Шы е 


TABLE IV 


ANALYSIS OF COVARIANCE OF POST-TEST SCORES FOR ALL EXPERIMENTAL GROUPS 


s o Oe — — — c uam US ал 


Source of Variation Sum of Squares df Mean Square Dev. F 
? 1.03 
Adjusted means 30 8 3.75 0 
Within groups 949 260 3.65 
979 268 


Total 


ШИ —— cC ere 
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preparation of the original programmed material. 
The latter may, of course, be conducive to good 
test performànce. 

Groups ЗҮ and 32 did not receive the usual 
feedback and Group 8 worked at a group pace. 
However, these three groups seemed to learn as 
effectively as the groups who worked with the 
standard linear program. The success of Group 8 
may be of particular interest to teachers who con- 
template using programmed material in a form 
which requires group pacing. The film strip, slide 
and overhead projectors may be of interest for 
such presentations. 

The success of Groups 3Y and 3Z may be of in- 
terest to teachers and researchers who question 
the need for frame-by-frame feedback as offered 
in the standard linear program. The need to check 
responses at each frame may be unnecessarily 
time-consuming and may not enhance learning more 
than occasional or less frequent checking of a num- 
ber of responses for several frames. 

While the sample of programmed material used 
in this study was very short, this afforded the ad- 
vantage of control in that the learning sequence 
could be completed in one session. In a longer 
learning session the differences among groups 
might have been greater, and a clearer picture of 
relationships among the methods might have been 
afforded. 

Further investigations are under way to test the 


effectiveness of other variations in the program 
and the manner of presentation. 


Summary 


Nine experimental groups of thirty subjects 
each received instruction with nine variations of 
programmed material to test the hypothesis that 
there would be no differences in learning among 
the groups. Analysis of covariance was used inas- 
much as pretest scores were correlated with the 
post-test scores which were used to test the hypo- 
thesis. Тһе difference among experimental groups 
was not significant; thus the hypothesis was upheld. 
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An Analysis of the Efficiency and Effectiveness of 
Selected Aspects of Machine Instruction 


STUDIES investigating the comparative efficien- 
cy and effectiveness of teaching machine instruction 
have been carried out at several levels of instruc- 
tion, using various subject matter contents. Feres- 
ter and Sapon (3) reported that the achievement of 
students in a small informal class using teaching 
machines in the study of German was comparable 
to the achievement of students in a conventional 
college German class. Though Ferester and Sapon 
report that little instructional time was saved by 
the experimental group as a result of the use of 
teaching machines, it was assumed that the students 
in the conventional German class spent a consider- 
able amount of time in outside preparation whereas 
the teaching machine instructed group did not. 

McNamara and Hughes (5) reported that a class 
of IBM customer engineers usingteaching machine 
instruction in а 7070 training course scored signif- 
icantly higher on a criterion test than another group 
taught the same content by means of the lecture 
discussion method. Inaddition, a substantial sav- 
ing of instructional time was foundin favor of the 
machine instructed group. 

Porter (6) reports a study іп which two groups 
of sixth-grade students learned spelling, one by 
means of teaching machines and the other through 
conventional methods of spelling instruction. The 
achievement of the machine instructed group Was 
found to be significantly greater. A significant 
Saving of instruct ional time favoring the machine 
instructed group was also found. In another study 
reported by Roe (7), the content of mathematics 
was taught to freshmen engineering students. The 
machine taught groups were found to perform sig- 
nificantly better on the criterion test than the 
groups taught by the lecture method. This study, 
however, reports that the lecture group took less 
mean time to complete the course than the machine 
instructed group. 

At least four studies have been designed to yield 
data regarding the comparative effectiveness of 
constructed and selected modes of response to 
teaching machine programs. Ina study in which 
short deductive proofs, similar to those used in 
symbolic logic, were taught to two groups by means 
of the constructed and selected modes of response, 
Evans (2) found no significant differences in a- 
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chievement of students taught by the two pr ogram- 
ming techniques. Іп а study using the content of 
introductory psychology, Coulson and Silberman (1) 
report no differences on the criterion test between 
students taught by means of constructed and select- 
ed response programs. Roe (7) indicates similar 
findings in a study using the content of mathematics 
taught to college greshmen. Fry (4) on the other 
hand found that in the teaching of Spanish vocabu- 
lary to high school students, the constructed re- 
sponse mode was more effective when the criterion 
of learning was recall. No significant differences 
could be found, however, when the criterion test 
was of the multiple choice variety. 


The Problem 


This study was designed to add to the body of 
research surveyed above by investigating the effi- 
ciency and effectiveness of severalaspects of teach- 
ing machine instruction, using as the content taught 
the historic foundations of the secondary school 
curriculum. Specifically, the study dealt with the 
following three areas: 1)the comparative effec- 
tiveness of teaching machine instruction and the 
conventional lecture-discussion method of instruc- 
tion; 2) the comparative efficiency of teaching ma- 
chine instruction and the conventional lecture-dis- 
cussion method of instruction, 3) the comparative 
effectiveness of the constructed and selected re- 
sponse modes and a combination of these two modes 
used in succession. 

The five hypotheses tested, stated as null hy- 
potheses, were as follows: * 

1. There is no difference in the amount of learn- 
ing produced as а result of theteaching machine in- 
structional technique and the conventional lecture- 
discussion instructional technique. 

2. There is no saving of instructional time as а 
result of the use of the teaching machine instruc- 
tional technique as compared with the conventional 
lecture-discussion instructional technique. 

3. There is no difference in the amount of stu- 
dent learning produced as a result of instruction 
from constructed response programming and select- 
ed response programming. 

4. There is no difference in the amount of stu- 
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dent learning produced as a result of instruction 
from constructed response programming and com- 
bination, selected-constructed, response program- 
ming. 

5. There is no difference in the amount of stu- 
dent learning produced as a result of instruction 
Írom selected response programming and combina- 
tion, selected-constructed, response programming. 


The Procedure 


This study involved the use of an experimental 
group (teaching machine instruction) and a control 
group (lecture-discussion instruction) formed from 
forty-one junior and senior students in the Division 
of Secondary Education, Temple University, enrol- 
led in the course “Тһе Contemporary Secondary 
School”’ for the fall term, 1960. The content taught 
during the course of the experiment involved the 
historic foundations of the modern secondary school 
curriculum, the content typically taught during the 
first three to four weeks of this course. 

The experimental aspect of the study involved 
three phases. Тһе first phase included the devel- 
opment of the teaching machine programs and eval- 
uation devices to be used in the study. Тһе second 
phase included the gathering of data necessary to 
Structure equivalent groups on the criteria used for 
this purpose, verbal ability and knowledge of con- 
tent to be taught. The third phase involved the in- 
Struction of the groups and the gathering of data 
necessary to draw conclusions in terms of the hy- 
potheses stated for the study. 

Phase one took place during the summer of 1960. 
During this time, the instructional program to be 
used in the teaching machines was written, edited 
and transcribed to the paper stock to be used in the 
machines. The major evaluation device, a one- 
hundred and fifty question true-false test, was also 
constructed and piloted on a summer session edu- 
cation class. As a result of the trial of the test, 
obvious and ambiguous questions were eliminated 
or rewritten and the final instrument was prepared. 
Two fifty-question true-false quizzes to be adminis 
istered at the end of the first and second weeks of 
instruction were prepared by rewording questions 
from the major test. These quizzes covered con- 
tent to be taught during the first and second weeks 
of instruction. 

Content validity was established for the instru- 
ments by insuring that questions were based on the 
content of the instructional program and lectures 
and that questions were selected in equal number 
from content to be taught in each instructional ses- 
sion and by the three response modes to be used. 

The reliability of the instruments was establish- 
ed by the use of a Kuder- Richardson test for inter- 
nal consistency reliability for both the major, one- 

hundred and fifty question test and the two fifty- 
question quizzes. These coefficients of reliability 
are reported in Table I. 


Phase two took place during the first week of the 
fall semester, 1960. During this time, the V-score 
of the College Entrance Examination Board, Scho- 
lastic Aptitude Test was obtained for each student 
registered for the course, “Тһе Contemporary 
Secondary School". In addition, these students met 
for the first session of the course inaroom re- 
served for testing, where they took the one-hundred 
and fifty-item major test as a pre-test of knowledge, 

Four test scores were used for equating the 
groups: 1) the V-score of the College Entrance Ex- 
amination Board, Scholastic Aptitude Test, 2) the 
total pre-test score, 3) the first week score of the 
pre-test (test items on knowledge to be taught the 
first week of instruction), 4) the second week, pre- 
test score (test items on knowledge to be taught 
during the second week of instruction). The exper- 
imental group (N = 20) and the control group (N -21) 
were then structured in such a way that there were 
no differences in mean score or variability on the 
four measures used as veriables for equating the 
groups. The mean scores and standard deviations 
for both appear in Table II. 

After the experimental and control groups had 
been structured, the students met again as a total 
group at which time they received assignment to 
their respective groups and were given information 
and instructions regarding the experiment. At this 
time the students were asked not to use outside ref- 
erences during the course of the experiment and not 
to discuss the content of the subject matter being 
taught with other members of the class. Therewas 
no indication that any student violated this request 
for cooperation. In addition, students in the con- 
trol group were asked to keep a record of the time 
they spent studying lecture notes outside of class. 

Phase three included the three and a half weeks 
during which the experimental and control groups 
received instruction and during whichtime the eval- 
uation of learning efficiency and effectiveness took 
place. 

The control group met at the regularly scheduled 
class time for the course for a total of 545 instruc- 
tional minutes distributed over seven class meet- 
ings. At this time they received instruction by 
means of the lecture-discussion method of instruc - 
tion. Care was taken by the experimenter, the lec- 
turer for the control group, tocover withthis group 
the same content with the same emphasis as this 
content was being covered inthe experimental group. 
Members of the experimental group were assigned 
to three instructional sessions each week for three 
weeks for a total of nine instructional sessions. 
The pattern of response modes used in each of the 
nine instructional sessions is shown in Chart 1. 

At the end of the first and second weeks of in- 
struction, both groups were given a fifty-question 
true-false quiz. The first week quiz was unan- 
nounced and a survey of the control group indicated 
that no student had studied for the quiz. The second 
week quiz as well as the final post-test were an- 
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CHART 1 


PATTERN OF RESPONSE MODES USED IN THE 
MACHINE-INSTRUC TIONA L SESSIONS 


First Session Second Session Third Session 
First Week Constructed Selected Combination 
Response Response Response 
Second Week Combination Constructed Selected 
Response Response Response 
Third Week Selected Combination Constructed 
Response Response | Response 
TABLE I 


COEFFICIENTS OF INTERNAL CONSISTENCY RELIA VILITY 
FOR THE EVALUATION INSTRUMENTS USED 


| 
Internal Consistency 
Reliability 
Major Test (150 Items) 219 
First Week Quiz (5о Items) .49 
| Second Week Quiz (50 Items) .66 
TABLE П 
COMPARISON OF THE CONTROL AND EXPERIMENTAL GROUPS ON THE 
FOUR TEST SCORES USED FOR EQUATING THE GROUPS 
Control Exp. Group t Level of Control Exp. Group F Level of 
GroupM M Sig. Groupo с Sig. 
Total Pre-test Score 85.24 85.25 . 004 n.s. 6.78 5.91 1.15 n.s. 
First Week Score 31.19 31.15 . 038 n.s. 3.32 3.87 1.16 n.s 
Second Week Score 27.57 27.50 . 038 n. 5. 3.86 3.74 1.03 ns. 
V-score 490. 43 492.50 .09 n. S. 72.28 74.28 1.03 п.8. 
ТАВІЕ Ш 
О WEEKLY 
OMPARISON OF THE CONTROL AND EXPERIMENTAL GROUPS ON THE TW! 
eee QUIZZES AND THE FINAL POST-TEST OF KNOWLEDGE 
lof 
у 1 Exp. Grouj t Level of Control Exp. Group F Leve 
de M 3 Sig. Groupo c Sig. 
First Week Quiz 38.81 41.80 2.74 <.05 4.47 2.24 3.98 <.01 
Second Week Quiz 42.71 43.35 .49 n.s. 4.47 3.46 1.67 n.S. 
Final Post-test 118.81 118.95 .03 n. sS. 12.08 9.22 1.72 п.5. 
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nounced and a record of the time each student in the 
control group spent studying forthese tests was ob- 
tained. A record of the time students in the exper- 
imental group spent learning from the teaching ma- 
chines was also kept for each student. 


The Findings 


The null hypothesis that there was no dif- 
ference in learning as a result of teaching 
machine instruction and the lec ture-discus- 
sion method of instruction could not be re- 
jected. i 
A comparison of scores on the first week quiz, 

however, revealed a significant difference in favor 
of the experimental group. Since the definition of 
the lecture-discussion method of instruction includ- 
ed studying from lecture notes in preparation for 
examinations, the first week quiz, for which no 
tudent studied, served as a control over the vari- 
ablesof lecture note study time. A Comparison of 
the test scores for the two groups appears in Table 
ш. 

The null hypothesis that there is no saving 
of instructional time as a result of the use 
of teaching machine instruction as compar- 
ed with the lecture-discussion teachnique was 
rejected. 

When instructional time was computed so as to 
include time that the control group spent instudying 
from lecture notes,the saving in favor of the ехрег- 
imental group was found to be 47 percent. When 
the computation of instructional time included only 
time spent in class or working with the machines, 
the saving in favor of the experimental group Was 
44 percent. Both percentages represent not only 
а significant but highly substantial saving of instruc- 
tional time in favor of machine instruction. The 
mean number of minutes of instructional time spent 
by the control and experimental groups during the 
course of the experiment is shown in Table IV. 

None of the three null hypotheses dealing 
with the comparative effectiveness of the re- 
sponse modes used could be rej ected, 

No significant difference could be found between 
the effectiveness of one response mode and any 
other. Table V shows a comparison of means and 
standard deviations for each ofthe response mode 
Sub-test difference scores. 


Summary and Conclusions 


This study tested five hypotheses relating to sev- 
eral aspects of the efficiency and effectiveness of 
teaching machine instruction at the college level. 
The findings of this study ingeneral tend to support 
the findings of several other studies previously re- 
ported, 

Within the limitations of this study (smallness of 
Sample size, the fact that only one lecturer was 
used and the possible impact of the experimental 
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situation) the following summary statements may 
be drawn from the data: 

1. Teaching machine instruction was found to 
be equally as effective an instructional technique as 
the lecture-discussion method for teaching the con- 
tent used in this experiment. Without the opportun- 
ity for out-of-class study of lecture notes, teaching 
machine instruction was foundto be significantly 
more effective than the lecture-discussion method. 

2. The teaching machine instructional technique 
was found to be considerably more efficient than 
the lecture-discussion instructional technique when 
efficiency was defined as the saving of instructional 
time. 

3. None of the three response modes used (con- 
structed, selected, or a combination of the two) was 
found to be more effective in producing learning 
than any other. 
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The Effect of Receiving the Confirming Response 


In Context in Programmed Material" 


IT WOULD BE consistent with Guthrie's (1952) 
position to assert that the more frequently a de- 
Sired response occurs in contiguity with relevant 
stimuli, the greater the probability of appropriate 
future responses. It is possible that programed 
materials have not taken full advantage of this 
principle. Traditionally, the correct response 
to a frame in а programed textbook appears as 
an isolated word, phrase or number on the page 
following the frame. Such an arrangement is 
economical of space, but it may not be the most 
efficient learning procedure inasmuch as it sep- 
arates in both time and place the response from 
its appropriate context. 

It is hypothesized that more learning will oc- 
cur when the confirming response appears in its 
appropriate context than when it appears in iso- 
lation. This hypothesis would provide an indirect 
test of what Estes (1960) calls the law of context: 
“The substance of this law is that the response 
reinforced in any learning situation becomes as- 
sociated impartially with all cues present, those 
in the *background' as well as those the experi- 
menter or educator wishes to make the occasion 
for the ‘correct’ response” (page 764). 

The experiment here reported was designed to 
compare a program in which appropriate cues 
were provided for every appearance of the correct 
response with a program in which the confirming 
response appeared in isolation. It was expected 
that students would gaia more understanding from 
the ‘‘response in context'' program because of the 
additional presentation of the cues associated with 
that response. 


JOHN D. KRUMBOLTZ 
Stanford University 


BARBARA BONAWITZ 
Michigan State University. 


Procedure 


The basic program consisted of 153 frames 
(preceded by 19 frames of directions), designed to 
teach prospective teachers how to write valid 
classroom achievement tests. Two forms of the 
program were prepared. Тһе ‘‘isolation’’ form 
presented the confirming response to each frame 
as a single word or phrase in the customary man- 
ner. The ‘‘context’’ form presented the confirm- 
ing response as a complete sentence in which the 
omitted word or phrase was underlined. The pro- 
grams were identical in all other respects. Table 
I contains an excerpt from both forms to illustrate 
the difference between them. 

The subjects for the experiment were 32 under- 
graduates in a course in introductory educational 
psychology at Michigan State University. Subjects 
were assigned at random to three treatmentgroups: 
the ‘‘context’’ group, the ‘‘isolation’’ group, and a 
control group who received a completely different 
program on interpreting test results. Randomiza- 
tion was accomplished by arranging the three types 
of programs іп а random sequence and distributing 
them to students as they were seated in the class- 
room. Students were instructed to check their 
responses carefully and to be prepared for a test: 
on the material the following day. 

On the following day a test of thirteen short- 
essay questions was given. The first eight ques- 
tions measured understanding of the technical 
terms taught in the program and were scored to 
yield a maximum possible score of 26 points. The 
last five items required application of knowledge 
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in decisions about the validity of sample test items 
and objectives and were scored to yield a maximum 
possible score of 16 points. The last item in this 
group asked the students to state some educational 
objective and compose a valid test item along with 
directions for scoring that item. Scoring the final 
item was a highly subjective task. While the inter- 
nal consistency of the scorer was probably high, a 
reliability check with a different scorer yielded a 
correlation of only . 43 for this single item. Since 
the scorer did not know which subjects had taken 
which program and since all subjects were graded 
according to the same standards, the subjectivity 
of scoring probably had no systematic effect on 
differences between the groups. 


Results and Discussion 


A summary of the results appears in Table II. 
Since only three comparisons of pairs of means 
were of interest, the t statistic was chosen to test 
the statistical significance of the differences. In 
each comparison the null hypothesis was tested 
against the alternative hypothesis that the popula- 
tion means were unequal with the region of rejec- 
tion set at .05. The assumption of homogeneity of 
variance was found to be tenable in each compari- 
son when the rejection region was set at the . 05 
level. 

No significant difference was Observed between 
the means of the ‘‘context’’ and **isolation'' groups 
on the first eight items which measured knowledge 
of the terminology taught by the program (t = . 13). 
However, on the five items measuring application 
of principles the difference between these two 
groups did reach statistical significance (t = 2.87). 
The group receiving the confirming answer “іп 
context’? scored higher on the application test 
questions (items 9-13) than the group receiving the 
confirming answer “іп isolation. ”” 

There can be little doubt that the basic program 
on writing valid test items had a desirable impact 
on those taking it regardless of the method of re- 
ceiving the confirming answer. The control group 
Scored far lower on the total criterion test than 
did either of the other groups (t - 6.95 for the 
comparison between control and **isolation" 
groups). 

The finding of most interest is that the confirm- 
ing answer ‘‘in context" seems to have some de- 
Sirable effect on the ability to respond correctly 
to application questions. Since the study was based 
оп a small sample with only one type of teaching 
material and a brief criterion test, this finding can 
only be suggestive of future research. Several 
questions are pertinent for any future research 
Work along these lines: 


(a) Did the group receiving the answer “ іп 
context” actually read the entire context? A later 


inquiry of the 13 ‘‘context’’ students revealed that 
only one student reported reading the whole re- 
sponse ‘‘all the time, ” ten read it **sometimes, ” 
and two just read the underlined words. The extent 
to which students actually attended to the context 
would be an important variable to control in any 
future experiment. 

(b) How much additional time is required for 
students receiving the confirming response “іп 
context"? In the present experiment students 
were asked to keep a record of the amount of time 
they spent actually working on the program. The 
‘context?’ group spent an average of 93 minutes, 
and the ‘‘isolation’’ group spent an average of 88 
minutes. This difference was not statistically 
significant since the pooled standard deviation was 
26 minutes. If a ‘‘context’’ program required an 
excessive amount of additional student time, many 
educators might prefer the simpler form. The 
time requirement does not appear to be excessive 
here, but future research would need to include 
the time variable. 

(c) Would students gain a better understanding 
if the ‘‘context’’ of the confirming answer were 
varied from the exact wording of the stimulus 
frame? Since the purpose of most teaching in- 
volves generalizing appropriate responses toa 
variety of equivalent stimuli, the generalizing pro- 
cess may be facilitated by placing the confirming 
answer in an equivalent, but not identical, context. 
Such an approach would be similar to the ‘‘conver- 
sational chaining” approach suggested by John 
Barlow (1960). 

(d) Would delayed measures of retention pro- 
duce a different picture? A study by Krumboltz 
and Weisman (1962) showed that a variable having 
no effect on immediate retention did affect a 
measure of retention delayed two weeks. Although 
no delayed retention measures were used in the 
present study, the importance of such measures 
should not be overlooked. 


Summary 


An experiment was conducted comparing two 
approaches of presenting the confirming response 
in a programed textbook designed to teach pro- 
spective teachers how to write valid classroom 
achievement tests. Тһе ‘‘isolation’’ approach 
consisted of presenting the desired response to 
the stimulus frame as a single word or phrase in 
the traditional programed manner. The ‘‘context’’ 
approach consisted of presenting the confirming 
response as a complete thought usually by insert- 
ing the desired response in a repetition of the 
relevant part of the stimulus frame. A control 
group received a completely different program. 
Thirty-two subjects were randomly assigned іо ће 
three treatment groups. 

Although the ‘‘context’’ groups did not exceed 


лотлецәд 
әтдЕллтәвдо 
10 вшләў ut 
po1e3s SIH 


лотлецәд 


ом 


әлептелә 45ә) 
eunseour 


рәләтцов z 
soAnoe(fqo т 


THE JOURNAL OF DOO RESEARCH 


‘OWI; snjnurrs әш SUIMOTIOJ e3ed ay} uo рәлеәййе 1oA4sue Зитшл1уиоо ә 


“101ЛЕЦӘД 
ӘТАЕЛЛӘ8(Ц0 JO SWI} UI paje}s SI 31 osneooq 
euo JSL Əy} ULY} 19339q st әлт}дә[до SUL 


c "91nje1ojr] 3ureroodde,, st oym juopnis 
ъ Jo лотлецәа ayy Ájroeds jou ѕәор 3I 


*poiejs Аүтеәто доп ST з ‘ON 


"eAn2efqo yey} 

әлпввәші о) WIJ 199] B jon1jsuoo 
0} зтпотуутр SI 31 “рәлтпрәл 1orAeueq 
ay} Ajtoads you ѕәор әлт}әә[до ue JI 


'seAnoefqo əy} gheaeryor 

sey juepnjs ay} ләшәцм әртоәр о) pue 
‘asanoo ƏY} jo .searnoefqo ay} оў 3ueAoqo1 
ST uotuA xoraedoq әуептелә о) ләцовә) ƏY} 
Smo “әлорәләці *uonenjrs )6әҙ poo3 y 


“(жжж) esneooq 

9uo SET ӘҢ ULY} 19319q st eAnoefqo SUL 

ce `SƏJI0}S JOYS Теләләв JO o[Ajs pue 

jxejuoo ay} әтейшоэ pue 'ssnostp *ojeis 0j 
әтав aq pmous 3uopnis әці,,, :asanoo цецЗия 
че jo eAnoefqo sty} тәртеиоо 45213002 UT 


.'"91nje&1ojyr[ Sureroaagdde,, st oym 
quepnis € јо. ey} Ajroeds you ѕәор 
3r әѕпеоәд mjdjeu Аләл jou st әлцэә[до SUL 


фучәшәләтцәе S,juopnjs * 4S9} 0j surojt 

рттел 101116002 оў Ásvo St 31 Jey} os pojvejs 
&үхеәтә eAnoe[qo stq SI ,,'o1njexejt[ poo 
әјетоәлайе оў Азтв ay} јо зиәшаотәләр,, :se 
МӘЛІЗ SEM әѕлпоо цетуЗия ue jo әлцоәѓдо uy 


"eAnoefqo zey} 0} шә 
359} € JONI}SUOD оў JMOTIJIp St 31 “‘рәлтпђәл 
ZoTARyaq ay} Ajtoads jou saop әлцоә[до ив л 


*Бәлт}дәӘ[дО eu sey juopnis 
eun leuqjeu^ әртәәр 0} pue 'esinoo ay} jo 
9u Jo т Əy} 0} jueAoqox 
ST YOTYM лотлъҷәа әўептелә оў 19g9€9] au? 

SMOTTe “әлоўәләцу *uomnenjrs 4S9} рооЗ у 


SOT 


РОТ 


£0T 


€0T 


ТУТ EU pen ee eee 5. 


«ЧОЦЕТОВІ,, «АахәшоО,, әшъля snus ‘ON 
ut reAsuy Зитшлтуо2 әшел 


*5ЧЯЛ5МУ ONIIHLIINOO AHL ONILNESHUd ЯО ЅИЧОЯ OML AHL ONLILVHISQ' TII 1аняэхя Wvuboud 
IG'ISV.L 


474 


KRUMBOLTZ - BONAWITZ 415 


TABLE II 


CRITERION TEST RESULTS FOR THE THREE TREATMENT GROUPS 


Mean of 8 Mean of 5 Total Total 
Group N “Knowledge” Items ** Application’’ Items Mean Variance 
€ ee oO 
“Context”? 13 12. 00 10.77 22.77 22. 69 
“Isolation”? 9 12. 23 8.44 20. 67 20. 00 
Control 10 $ * 9.00 7.56 


ж Not computed 


the ‘‘isolation’” group in knowledge of terminology i Machine Progr ESTIS 
on the criterion test, they did excel significantly eports, УШ, › 1990, pp. -199; 
on ability to apply principles of test construction. 2. Estes, William K. коше gi E W. 
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An Experiment Involving Use of English 2600, 


An Automated Instruction Text 


THE REAL impact of the teaching machine and 
the programmed textbook has just begunto make it- 
self felt. While a number of studies have been un- 
dertaken in the use of automated instructional ma- 
terials and methods of presentation in the past few 
years, educators have not yet foundthe ways this 
radically different approach can take to ease the 
teaching burdens of the modernconscientious teach- 
er. The intense interest whichthe automated meth- 
od has stirred in recent years has been shared by 
the Denver Public Schools in its effort to provide 
quality education for Denver youngsters. 


Description of Study and Conclusions 


One of the first programmedtextbooks published 
for schooluse is English 2600, written by Joseph 
Blumenthal and published by Harcourt, Brace, and 
Company, 1960. This work, which covers much of 
the content of English grammar, punctuation, and 
capitalization, contains 2632 items to which the 
learner must respond. It offers, theoretically, to 
the teacher of English a type of self-correcti ng 
homework which would relieve that teacher of the 
necessity of teaching, offering practice in, andtest- 
ing many mechanical aspects of the E nglish lan- 
guage. The time and energy saved by the teacher 
in having pupils learn grammaticalfacts abouttheir 
language can then be used in provocative discus- 
sions of the literature read by pupils andof the stu- 
dent writing which grows out of literary experien- 
ces as wellas personal experiences. This changed 
emphasis on the part of the teacher of English, 
however, can come about only when he is satisfied 
that the self-correcting homework helps the pupil 
to learn, as adequately as he should, the content of 
English grammer. Inthe autumn of 1960, the ma- 
terial of English 2600 was relatively untried. This 
study was undertaken with a dual pur pose: 1) to 
show the degree to which English grammer may be 
learned through the use of English 2600; and 2) to 
add to the storehouse of knowledge about the effec- 

tiveness of automated learning through trial use of 
English 2600. 

English 2600 has been revised since the study 
was completed, and the new edition seems superior 


JERRY E. REED, Denver Public Schools 


JOHN L. HAYMAN, Jr., Stanford University 


in several respects to that on which this report is 
based. Those concerned withthe Denver experience 
feel, however, that several general principles rel- 
ative to automated learning were suggested through 
use of the original text. This report presents these 
principles and describes the research methodology 
used to derive them. 


Problems Investigated 


The study of the use of the automated textbook, 
English 2600, was designed to yield answers to the 
following questions: 

1. Does English 2600, as a learning instrument, 
work equally well with pupils who have low academ- 
ic achievement as it does with those who have av- 
erage and high achievement? 

2. Is learning substantial for students who have 
worked through English 2600, compared to those 
with more traditional learning experiences? 

3. How long does it take for pupils of varying 
abilities to work through English 2600? 

4, Which parts of the book are especially diffi- 
cult? 


Design of the Study 


Five high schools participated in the study. In 
each school, two classes of tenth-grade English 
were experimental groups, using the textbook Eng- 
lish 2600. For each of these classes, a control 
group in the same high school was tested, but each 
pursued the regular course as planned by the teach- 
er. The study covered a period of three months, 
beginning October 31, 1960, and ending January 27, 
1961. 


Grouping 


Classes at differing tracks were used to deter- 
mine differences in accomplishment between able 
and limited pupils. The ten experimental classes 
and the ten control classes included groups in Eng- 
lish 3 (average ability), English 3M (low ability), 
and English 3X (high ability), as shown in the chart 
at the top of the next page. 
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Experimental Groups 


Control Groups 


School ЖЕ мыл не” 3M ЖЕСТ ТТТ НГ ЕНЕМ 

І Teacher А Теасһег А Teacher В Теасһег В 

П Teacher С Teacher С Teacher D Teacher С 
ш Teacher E Teacher E Teacher E Teacher E 
IV Teacher F Teacher F Teacher F Teacher G 
V Teacher H Teacher H TeacherI Teacher H 


The above chart indicates that two high ability 
groups, four average ability groups, and four low 
ability groups were used in the experiment. Also, 
it can be seen that five of the control groups were 
under the direction of the same teacher who hadthe 
experimental groups. Approximately 250 pupils 
were enrolled in the experimental groups. 


Use of Textbook 


Teachers of experimental groupsintroduce d 
their pupils to the method of self instructionthrough 
the programmed textbook, English 2600. Answer 
sheets were devised to facilitate the analysis of pu- 
pil responses and to yield several items of infor- 
mation for each individual: 

Record of responses, 

Number of errors made 

Time spent on each lesson 

Pupil opinion concerning difficulty of lesson 
Comments of pupil 


ао 


Data Collected 


To determine amount of learning by pupils in the 
experimental groups as contrasted by the control 
groups, two pre-tests and two post-tests were ad- 
ministered to all twenty classes. Опе of the two 
tests was a standardized test--the Language Sec- 
tion of the California Battery (Form W as pre-test, 
Form Y as post-test). The second test was admin- 
istered as a mimeographed test, but it was actually 
the final test in the booklet of mastery tests which 
accompanies the textbook, English 2600. These 
tests formed the criterion variabls for the study. 

Other measurements included, for each individ- 
ual, the academic aptitude (IQ), the academic rating 
in English (grade point average for Grade9in Eng- 
lish), the academic rating of other subjects, and 
test scores from three sections of the Iowa Tests 
of Educational Development: Correctness in Writ- 
n Reading in Literature, and General Voc abu- 

ary. 


Treatment of the Data 


The first part of the analysis involved compari- 
sons between test scores of the groups of pupils on 
two separate measures. One comparison involved 
test means for the three experimental groups to de- 
termine if the better pupils learn more from Eng- 
lish 2600 than poorer pupils. Asecond comparison 
"was made between test means ofthe total experi- 
mental group and the total control group to deter- 
mine whether pupils who took English 2600 learned 
more than those who had regular instruction. To 
determine whether results were consistent for all 
pupils, a final comparison in this portion of the an- 
alysis was between the test mean for each experi- 
mental group and that of its related control group. 
These comparisons were made for both measures 
—the standardized test and the English 2600 test. 

Relatively large differences between experimen- 
tal and control groups on the related measures in- 
dicated that equivalence inlearning ability could not 
be assumed. Therefore, covariance analysis was 
chosen asthe appropriate statistical technique to 
assure a valid reflection of the teaching m ethods 
used. 

The second phase of the analysis involved ехрег- 
imental pupils only and was concerned with error 
rates, time rates, and perceived difficulty of les- 
sons. From the answer sheets of the pupils, time 
rate for individual lessons and error rate for indi- 
vidual frames were computed. Comparisons were 
made between time rates and error rates for the 
various ability groups. Correlations were comput- 
ed within groups and between groups on difficulty 
rating, error rate, and time rate. 


Conclusions; 


The study sought to answer four questions, two 
comparing the method of English 2600— that is, 
automated instruction—with more traditionalteach- 
ing methods, and two concerning time and difficul- 
ty of English 2600. Statistical summaries leading 
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to the following conclusions are given in the statis- 
tical data part of this report. 

1. Does English 2600, asalearning instrument, 
work equally well with pupils who have low academ- 
ic achievement as it does with those who have aver- 
age and high achievement? 

At the tenth-grade level, English 2600 proved to 
be more effective with students who have high 
achievement. On both the standardized andthe non- 
Standardized measures employed in the study, the 
more able pupils using English 2600 scored higher 
than those who did not use English 2600. Also on 
both measures, the low achievement pupils who did 
not use English 2600 scored higher than those who 
did. No significant difference was found between 
average achievement pupils using English 2600 and 
similar pupils learning by other methods. 

2. Is learning substantial for students who have 
worked through English 2600, compared to those 
with more traditional learning experiences? 

The scores on the two measures employed in this 
study indicate that learning was substantial for all 

Students in both experimental and control classes. 
The amount of gain in test scores was about the 
Same for the total groups, including all three levels 
of ability. It cannot be concluded that the automat- 
ed method as utilized in English 2600is a superior 
method for all learners. 

3. How long does it take for pupils of varying 
abilities to work through English 2600? 

The study indicated that, as expected, more 
able pupils work through English 2600 more rapid- 
ly than do average ability pupils, who in turn work 
much more rapidly than do low achievers. On the 
average, able tenth graders finished English 2600 
in 9.68 hours, average pupils took 11.30 hours, 
and low achievers required 14,23 hours. 

iB Which parts of the book are especially diffi- 
cu 

The frames which presented the most difficulty 
to the most pupils were examined carefully. An 
example of a confusing frame is No. 21, in which 
pupils are asked to respond with the word ‘‘predi- 
cate. ’’ Many tenth grades respond instinctively 
with the word ‘verb, ” which isn’t introduced in 
English 2600 until Frame No. 92. Similarly, in 
Frames No, 440, 441, 442, pupils are asked to 
Supply suitable helping verbs, but only one or two 
“correct” responses are given; when pupils know 
instinctively that their answers are correct, but do 
not get reinforcement from the printed answer, 
they are confused and many are angry. 


Opinions of Participating Teachers 


In addition to the foregoing answers to the ques- 
tions of the Study, the teachers using English 2600 
in their classes expressed their opinions about the 
implications of the method as follows: 

1. Despite extreme efforts to introduce the 


method of study to the pupils adequately, some pu- 
pils were found to cheat, particularly among the low 
achiever groups. Some of the comments of pupils 
bore out their misunderstandings of the method; a 
comment like “I hate these tests! " shows that 
some students were not convinced that English 2600 
is a new learning method rather than a test. Two 
of five teachers felt that cheating was no problem; 
they were unaware of any cheating on the part of 
their pupils. 

2. Allteachers agreed that the materials in 
English 2600 are not equally effective with pupils of 
varying ability and achievement levels. Most felt 
the method better suited to average or above-aver- 
age ability groups, while one felt the materials in 
English 2600 were better suited to the low achiever 
in Grade Ten. This latter comment is made in light 
of the fact that much of the content of English 2600 
is review material for most pupils of Grade Ten. 
This general feeling is supported by the statistics 
of this study. 

3. Most teachers expressed the feeling that most 
youngsters enjoyed the automated method of learn- 
ing initially, but several pointed out that, when the 
novelty of the situation wore Off, many pupils suf- 
fered from boredom. These comments have impli- 
cations for the specific ways automated materials 
might be used in the average classroom. 

4. Most of the teachers involved felt that auto- 
mated materials should be employed as a portion of 
the instructional materials in English, In general, 
they felt that the materials, for the present, should 
be supplementary and used to provide variety in 
classroom procedures. They indicated, too, that 
the materials should be used in close correlation 
with other work in process in the classroom. 

5. Among the advantages listed by teachers of 
the automated method are immediate reinforcement, 
meeting of individual needs, progress at individual 
Speeds, increased teacher time available for indi- 
vidual problems, and the novelty of the method. A- 
mong the disadvantages they cited were cheating, 
lack of motivation and desire to make full use of the 
materials, and boredom. In addition, teachers 
Specifically cautioned against using programs which 
are remote from the classwork of the group, they 
Suggested that programs differing in approach and 
difficulty be available for below-average groups, 
and one felt strongly that teachers and pupils must 
avoid the assumption that completing a program 
would guarantee learning. 


Statistical Analysis 
SS ena ysis 


Of the four questions whicht he study sought to 
answer, the first two involve comparisons of ex- 
perimental and control groups in terms of amount 
learned, while the second two are concerned only 
with experimental students and their performance 
on English 2600. 
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English 2600 vs. Regular Instruction 


To determine the appropriate analysis technique, 
groups were first compared on all variables for 
which data were collected. Relatively large differ- 
ences between experimental and control groups indi- 
cated that equivalence in learning ability could not 
be assumed. Therefore, covariance analysis was 
chosen to assure a valid reflection of the teaching 
methods used. 

IQ and English grade-point average were chosen 
as control variables because they correlated sub- 
stantially with the criterion variables (г - approxi- 
mately . 700 in each case), and they measure char- 
acteristics on which students logically should be 
matched. IQ indicates the student's learning poten- 
tial and grades show the teachers' rating of his 
classroom performance. 

Matching on actual achievementin English also 
seemed desirable, but the Iowa tests presented 
special problems because data were incomplete for 
many subjects. Careful examination revealed that, 
of these, Correctness in Writing (Iowa 3) measured 
achievement in English grammar most directly. 
Therefore, this test was chosen as a control vari- 
able, and the Reading in Literature and General Vo- 
cabulary tests were eliminatedfrom further consid- 
eration. 

Group means on the three control variables are 
given in Table I. These control variables indicate 
that pupils in regular instruction groups were su- 
perior in learning ability, and appropriate adjust- 
ments in criterion test results were in order. Ta- 
ble I also shows striking differences among the 3X, 
3, and 3M pupils. 

Table II gives pre-test and post-test scores on 
both criterion tests. Alldifferences in Table II are 
significant with P<. 02 for the smallest difference 
on the English 2600 test and P<. 001 for all differ- 
ences on the California test. All pupils, then, both 
experimental and control, performed significantly 
better on the post-tests. Everybody learned a sig- 
nificant amount during the study. More important, 
however, are differences in learning between ex- 
perimental and control groups and the consistency 
of these differences among those of varying ability. 

These differences were considered by means of 
two separate covariance analyses. Inthe first, the 
2600 post-test was the dependent variable, and in 
the second, the California post-test was used. Ap- 
propriate pre-test scores were control variables 
in both cases. This method of handling pre- and 
post-tests is recommended by McNemar (McNemar, 
Psychological Statistics, pp. 355-56), and is more 
accurate than using gain scores as the dependent 
variable. In addition, it avoided the ceiling effect 
which could result from the high performance of 
3X students on both pre-tests. Significance tests 
are given in Tables IV and V. 

Tables IV and V show no significant difference 
in performance between experimental and control 


subjects, and both show a significant difference a- 
mong 3X, 3, ЗМ students. In addition, a signifi- 
cant interaction occurred in the analysis of the 2600 
test. This interaction calls for careful examination 
of group means. Means, adjusted by the covariance 
technique, are given in Table VI. 

Results of both tests indicate that the 3X students 
learned more from automated instruction (English 
2600), while the 3M students learned more from 
traditional instruction, and the significant interac- 
tion in Table IV suggests that this effect was real. 

To summarize, learning was substantial for all 
students, and, overall, those whoworked with Eng- 
lish 2600 learned about the same amount as those 
with the more traditional learning experiences. The 
results indicate, however, that English 2600 was 
more effective with high achievement students than 
it was with the low achievers. 


Performance on Automated Instruction Materials 


The second part of the analysis involved experi- 
mental students only and was concerned with error 
rates, time rates, and perceived difficulty of les- 
sons, Table VII gives these data by English 2600 
unit and by 3X, 3, and 3M groups. 

Difficulty rating is derived from student ratings 
of each lesson on a four-category scale: 1) this les- 
son is very easy, 2) this lesson is rather easy, 3) 
this lesson is rather difficult, and 4) this lesson is 


. very difficult. 


The results in Table VII are consistent across 
groups; i.e., 3X students thought each unit was 
easier than did the other students, they made few- 
er errors, and they took less time, while Group 3 
students were second in each case. 

Differences between the groups ondifficulty rat- 
ing, error rate, and time rate for the whole book 
are tested for significance in Table VIII, The sig- 
nificance of all differences beyond the . 05 level 
suggests that real variations occurred among stu- 
dents of differing abilities in performance on the 
English 2600 book. 

The groups did not agree on the relative difficul- 
ty of the units. Rank correlations (rhos) in Table 
IX show, however, that the relationship between 
perceived difficulty was greater than that between 
error rate; the average between-groups rho on dif- 
ficulty rating was .510, while that for error rate 
was . 334. 

Table IX also suggests some other conclusions, 
For example, 3X pupils seem to be more similar 
to 3 pupils than 3 pupils are to 3M pupils, while 
3X's and 3M’s are substantially different. The av- 
erage rho between 3X’s and 3’s on all variables 
was .780, between 3’s and 3M's маз. 545, and be- 
tween 3X’s and 3M’s only .240. This result and 
the interaction discussed earlier raise Ser ious 
questions as to use of the same automated instruc- 
tional materials with students of widely differing 
academic abilities. 
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Average between-groups rhos of . 721 on time 
rate, .510 on difficulty rating, and . 344 on error 
rate suggest that all pupils were most consistent 
on amount of time spent on each frame and least 
consistent on error rate. Average within-groups 
rhos show that error rate and time rate were most 
closely related (. 489), error rate and difficulty 
rating next (. 308), and time rate and difficulty rat- 
ing least related (. 155). Thus, pupils made the 
most errors on frames which took the most time, 
but they did not perceive these frames as most dif- 
ficult. 

A more meaningful way to look at time rate, 
perhaps, is in terms of frames per minute and 
frames per hour. Table X presents time rate in 
this fashion. 

Group 3X pupils averaged 87 frames an hour 
more than 3M pupils and 40 more than 3 pupils; i. 
e. , the 3X's worked about 45 percent faster than 
the 3M's and 17 percent faster than the 3's. At 
this rate, the 3X pupils would complete English 
2600's 2632 frames in about 9. 68 hours, while 3 
pupils would need 11. 30 hours and ЗМ pupils would 


need 14. 23 hours. 

Question No. 4 in the study dealt with difficult 
portions of English 2600. An analysis of comments 
written on the answer sheets shows that pupils in 
the 3X groups were concerned with frames which 
were incorrect or confusing (about 70 percent of 
their comments were in this category). Pupils in 
the 3 groups commented primarily on incorrect or 
confusing frames (33 percent) and on the simplici- 
ty of this learning method (36 percent). Pupils in 
3M groups commented about confusing frames (28 
percent), that they liked the method (18 percent), 
and that they found parts of English 2600 either 
easy or difficult (33 percent). A summary of the 
comments on English 2600 by group and by percent 
of comments in each category is given in Table XI. 


A careful tabulation was kept as pupils turned in 
answer sheets in an effort to discover whichframes 
were missed by most pupils. Two hundred and 
twenty-one frames, 8. 4 percent of the total book, 
had error rates of ten percent or more, while 21 
frames had an error rate of 20 percent or more. 


TABLE I 


GROUP MEANS ON THE CONTROL VARIABLES 


=————Є—Є———Є—Є———Є—Єї]ү———ЄүЄ 


English Classification Variable Method of Study 
2600 Regular Total 
3X Iowa 3 87.343 86. 333 86. 775 
19 116.528 116.878 116.726 
English ОРА 3.594 3. 400 3.485 
3 Iowa 3 63.380 69. 408 69.394 
IQ 102.514 106.260 104. 388 
English GPA 2.327 2. 493 2.410 
3M Iowa 3 32. 020 26.979 29.551 
IQ 89. 020 87.126 88. 092 
English GPA 1.330 1.188 1.260 
Total Iowa 3 58. 705 61.634 60.206 
IQ 101.334 103.574 102. 462 
English GPA 2.292 2.360 2.327 


————M———————M————MMMÁÉÓÉÁEo 
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TABLE II 


GROUP MEANS ON THE 2600 PRE- AND POST- TESTS AND 
ON THE CALIFORNIA PRE- AND POST- TESTS 


Ф 
English Classification 2600 Test-- Method of Study California Test--Method of Study 
2600 Regular Total 2600 Regular Total 
3 Pre-test 74. 657 78.578 76. 863 83.629 84. 913 84. 358 
X 
Post-test 82. 886 82. 044 82.413 97.200 94. 348 95. 580 
Pre-test 64. 507 67.296 65. 901 58. 870 65. 015 61. 897 
3 
Post-test 68. 324 71. 761 70, 042 71.203 76.299 73.113 
Pre-test 46. 700 47. 479 47. 082 25.526 21.380 23. 170 
3M 
Post-test 48. 840 49. 042 48. 939 43.131 31.560 39. 795 
Pre-test 61.077 64. 591 62.878 56.049 57.245 56. 689 
Total А 
Post-test |  65.346 67. 933 66.672 69. 993 69. 509 69. 734 
TABLE Ш 


FREQUENCIES AND STANDARD DEVIA TIONS FOR PRE- AND POST- TESTS 


English Classification 2600 Test--Method of Study California Test--Method of Study 
2600 Regular 2600 Regular 
3X N 35 45 35 46 
o Pre 7.107 9.187 6.547 13.349 
с Post 8. 606 8.691 4.111 13.878 
3 N 71 71 69 67 
с Pre 13. 046 12.999 25.116 21.184 
с Post 13. 849 13.799 23.306 24,579 
3M N 50 48 38 50 
o Pre 9, 124 8.270 26. 992 27.000 
о Post 8.624 9.841 25.309 23.508 
TABLE IV 
COVARIANCE ANALYSIS: 2600 FINAL AS DEPENDENT VARIABLE 
= 
шіні gee Degrees of Suc Squres Mean Square dome E z^ anon 
Method of Study 1 3.068 3. 068 351 > .50 
English Classification 2 632. 424 316.212 6. 467 < .001 
Interaction 2 364. 171 182. 086 3.724 < .05 
Within Groups* 310 15, 158. 852 = 48. 899 


N = 320 


wean ac orror term. 
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TABLE V 


COVARIANCE ANALYSIS: CALIFORNIA ACHIEVEMENT TEST AS DEPENDENT VARIABLE 


Source of Variation Residual * 
Degrees of Sum of Squares Mean Square Frequency Probability 

Freedom Ratio 
Method of Study 1. 40.441 40.441 <1 >.50 
English Classification 2 1, 890. 906 945. 453 3.338 <.05 
Interaction 2 185. 177 92. 389 <1 > .50 
Within Groups* 295 83,559. 626 283.253 
ж Within groups mean square used as error term. N = 305 

TABLE VI 
ADJUSTED GROUP MEANS $ 


English Classification 2600 Test--Method of Study California Test--Method of Study 
2600 Regular Total 2600 Regular Total 
3X 70. 683 68. 389 69. 259 76.261 74. 460 75.234 
3 67.197 67.895 67.575 70.834 69.914 70. 231 
3M 61.571 64 869 63.243 62. 668 64. 850 63. 905 
Total 66. 686 66. 655 66. 672 70. 054 69. 444 69. 734 
¢ 
TABLE VII 


COMPARISON OF GROUP MEANS ON DIFFICULTY RATING, 
ERROR RATE, AND TIME RATE 


Unit Әйа tating c Error Rate Time Rate? 
M 

1 149 17) 203 019  .039  .071 12186 —.329 1412 
2010 197 2.09 017 .037 062 22270 29270 .340 
з 188 193 2.03 015  .036 .059 .235  .267  .317 
4 167 18 19 023  .046  .0T -217 263  .31T 
5 L8] L8 L8 .028 038 .059 .2313 — .247 1324 
6 152 166 1.18 014  .032  .060 12020 .242 1317 
7 L6 16 1.80 015  .027 .050 .194  .233 .324 
8 LA 19 2.10 012  .035 .080 1167 .212 .309 
9 17! 18 2.02 029 .037 .046 .211  .257 1315 

10  L74 189 2.08 2008  .051 .104 .239  .277  .326 

11 177 188 2.13 .019  .038  .044 .229 .245 263 

Total 

Book 1.70 182 1,98 .020 .038 .065 


.220 .258 .324 


1 Rate given in errors per frame 


2 Rate given in minutes per frame 


“- 


шр чаи на бег 
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TABLE VIII 


SIGNIFICANCE OF DIFFERENCE BETWEEN GROUP MEANS ON 
DIFFICULTY RATING, ERROR RATE, AND TIME RATE 


5 
Du t 


ж Variable ^ Groups Compared Difference 
Difficulty 3- 3X .118 
Rating 

3M-3 .162 

3M - 3X .280 
Error 3-3х .018 
Rate 

3M-3 .027 

3M - 3X . 045 
Time 3-3X .038 
Rate 

3M- 3 .066 

3M - 3X - 104 


.050 2:360 
.051 3.170 
.053 5.250 


.0057 4.165 
.0060 7,500 


< 

< 

< 
.0031 5.186 <.001 

< 

< 


.013 2.850 «.02 


.014 4.570 <.01 


.014 7.200 «.001 


зр (probability) figures are for t with 10 degrees of freedom. 


* 


RANK CORRELATIONS 
DIFFICULTY RATING, 


TABLE IX 


WITHIN AND BETWEEN GROUPS ON 


ERROR RATE, AND TIME RATE 


Difficulty Rating Error Rate Time Rate 
Difficulty 3X „662 .300 .364 .082 
Rating 
.568 

3M 
Error 3X 
Rate e 

3^ 

3M 
Time — 3X .945 .530 
Rate 

3 . 689 

3M 


Between-group rank correlations are to the left of the diagonal line; 


are to the right of the line. 


TABLE X 
‘OR GROUPS IN TERMS OF FRAMES PER 


TIME RATE Fí 
MINUTE AND FRAMES PER HOUR 
Unit Frames per Hour Frames per Minute 
x М Т A EE S 
1 215.82 182.40 145, 62 3.597 3.040 2.427 
2 253.14 222.24 176, 46 4.219 3.104 2,941 
3 255.30 224.70 189.24 4.255 3. 145 3,145 
4 276.48 228.12 189.24 4. 608 3. 802 3.154 
5 281.70 242,94 185.16 4. 695 4, 049 3.086 
6 297.06 247.92 189.24 4.951 4.132 3.154 
" 309.30 257.52 185.16 5.155 4.292 3.086 
8 359.28 283.02 194.16 5. 988 4.717 3.236 
9 284.34 233.46 190.50 4.739 3.891 3.175 
10 251.04 216.60 184.02 4.184 3.610 3.067 
11 262.02 244.92 228. 12 4.367 4.082 3.802 
pens 272.70 232.56 185. 16 4.545 3.876 3.086 


within-group correlations 
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TABLE XI 


SUMMARY OF COMMENTS ON ENGLISH 2600 BY GROUP AND 
PERCENT OF COMMENTS IN EACH CATEGORY 


TM—————M————————————————————— 


Comment . EMT cMGropse ee t O 
3X 3 3M 
Disagreement with author about ananswer 33. 54% 12.70% 8. 86% 
Like this method of studying English 3.80 5.29 17.72 
One or more frames are confusing 34.18 20. 10 27.85 
» Like the method because it is easy 0. 63 1. 06 1. 26 
Dislike the method because it is |, 
{оо еазу 6.33 Р "i. 64 2.53 
Simple statement that method is easy 1.27 23.28 10.13 
Total who commented that method is easy 8.23 35.98 13.92 
Like the method because it is difficult 0. 00 0. 00 2.53 
Dislike the method because it is too 
difficult 1.90 2.64 11.39 
Simple statement that method is ү 
- difficult 3.80 5.82 5.07 
Total who commented that method is 
difficult 5.70 8.46 18.99 
Explanation by pupil of an error he 
made 5.06 2.65 3.80 
The book could be improved if it were 
rearranged in some way 3.80 1.59 0.00 
Other 5.70 13.23 8. 86 
100. 00% 100. 00% 100. 00% 
Number of comments made 158 189 79 
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Introdüction 


EXPERIMENTAL design—its structure, appli- 
cation, and utility in the interpretation of data— 
should provide the basic criteria for evaluating 
principles of programmed instruction. The theory 
of programmed instruction should not be used for 
evaluating; its function is to integrate, relate, and 
make meaningful the final outcomes of empirical 
research within the broadest possible context. Up 
to the present time the broadest possible context 
within which present programmed learning theory 
has proved workable is that of the experimental lab- 
oratory. 

Skinnerian theory has dealt adequately and par- 
simoniously with learning behavior under laboratory 
conditions (11). Stimulated by Skinner’s success, 
researchers from many fields have endeavored to 
substantiate and to utilize Skinnerian principles in 
many different contexts. One result of these efforts 
has been the gradual accumulation of evidence indi- 
cating that the laboratory theory is not clearly sub- 
stantiated in school settings (10). С ontradictory 
findings surround the issues of reinforcement and 
mode of response. Further, programs do not have 
to be organized into a progressive sequence as 
present theories would predict. The scientific key 
to program construction is not yet forthcoming. As 
professional educators, how are we to deal with 
these controversies, ambiguities and proclama- 
tions? Are we going to be flooded with a wave 
of myth and folklore about programmed instruc- 
tion, or do we have some objective procedures and 
criteria by which we can proceed reasonably and 
advantageously ? 

Our intention in this paper (as in the research 
reported) is to suggest some of the objective pro- 
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cedures and criteria which can be employed so that 
programmed instruction can be evaluated with 
reason, and its advantages to educational practice 
studied objectively. Following a discussion of the 
literature relevant to our experiment, a statement 
of the rationale and philosophy framing our re- 
search will set the stage for a more detailed ac- 
count of experimental procedure, analysis, and in- 
terpretation. 


Background and Literature 


The design of the experiment* incorporated 
two factors: levels of intelligence and mode of 
response. The relations and interactions among 
these factors and their various levels (high, middle, 
and low intelligence levels, overt and covert re- 
sponse modes) have been stated or implied by lab- 
oratory theory of programmed instruction. Contra- 
dictory evidence surrounds their investigation in 
school settings. In his recent review, Silberman 
(10:187) states that *t Beyond demonstrating that a 
carefully written set of materials willteachif а stu- 
dent will spend enough time studying them, we have 
little unequivocal evidence for principles of рго- 
grammed instruction". 

Accepted notions concerning the effect of level 
of intelligence and mode of response have received 
new impetus from programmed learning. These 
notions are that instruction will be improved if the 
effect of intelligence is reduced, and if the learner 
isactive. It appeared a year or So ago that pro- 
grammed instruction provided the means by which 
such improved instruction could be obtained. This 
was justifiable optimism, for it appeared that pro- 
gramming set learning conditions which “washed 
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out?’ the effect of intelligence and ‘‘stamped іп” 
knowledge by overtly involving the learner. 


Evidence supporting the ‘‘wash ош?” effect of 
programming on intelligence is meager. Little (5) 
found that students with lower ability performed 
better than expected when aided by an automated 
teaching device. No significantrelationshipbetween 
intelligence and achievement during a year-long ex- 
periment with second- and sixth-grade pupils was 
found by Porter (8), while Ferster and Sapon (2, 6) 
found that traditional measures of aptitude for 
learning German were less effective in predicting 
achievement among a group of students studying 
from programmed material, 


However, Evans, Glaser and Homme (see ref. 
1, and pp. 429-449 in this issue) found that the ef- 
fect of intelligence was one of the major factors 
producing significant differences in their study. 

The issue of response mode is fundamental to 
generallearning theory. In its application to pro- 
gramming, the involvement of the learner by re- 
quiring overt responses continues to remain widely 
used in preparation of the programs. Fry (3) found 
а constructed (highly overt) response mode to be 
significantly better than a multiple-choice training 
(less overt) response mode for both immediate and 
delayed post-tests under all conditions. A number 
of studies, reviewed by Silberman (10), report no 
significant difference between overt and covert 
modes. The only clear conclusion as reported by 
Stolurow (12) and Silberman is that the covert re- 
sponse mode increases the efficiency of learning in 
terms of time. Krumboltz and Weisman (4) have 
recently attempted to resolve this issue in terms 
of delayed retention. They found no significant dif- 
ferences between overt and covert modes on an im- 
mediate post-test but significant differences on a 
delayed retention test in favor of the overt response 
mode. 


In general, studies cited in this section evi- 
dence a number of inadequacies, particularly those 
of a methodological character. Most findings are 
based on programs of no more than 100 frames and 
consequently of short duration. Many ofthe reports 
omit data necessary for any Systematic evaluation 
and integration of findings. In the next section we 
will outline an approach to these problems and the 
requirements in terms of research design. 


A Philosophical Perspective 


From the viewpoint of schoolsettings, research 
on programmed instruction has accomplished two 
objectives. First, the method of instruction can 
now be adequately described and defined. Second, 
the processes implied by description and definition 
have been operationalized (à la Bridgman). The 


next step is to systematically study characteristics 
and outcomes of learning occurring under condi- 
tions of programmed instruction. 

This perspective is based on a number of con- 
siderations. One major consideration is the inade- 
quacy of experimental and analytical procedures 
utilized in reported research. For example, no 
study known to the authors has sampled the school 
population adequately. If the aim of programmed 
instruction is improvement of learning within the 
school setting, then definitive evidence needs to be 
forthcoming before schools expend money and ef- 
fort on the introduction of programming into the in- 
structional repertoire. The acceptance of pro- 
grammed instruction on the basis of “experiential” 
research can only lead to further confusion in cur- 
riculum practices. 

Three questions are implied if the intention is 
to improve learning within the school setting. What 
constitutes improved learning? Which students can 
improve by this form of instruction? What are the 
conditions under which improvement occurs? 

These questions guided the study, The article 
was based on the issues implied by the second and 
third questions, and an answer to the first is at- 
tempted by an analysis and interpretation of the re- 
sults presented, 


Experimental Design in Perspective 
аец Pe CULVE 


The questions stated in the previous sectionin- 
itially imply two requirements for the experimental 
design: first, an adequate sampling of a school 
population; and second, establishment of treatment 
conditions which could be expected to maximize dif- 
ferences among treatment groups by the end of the 
experimental period. These two requirements im- 
ply a third—that if differences are to be detected, a 
sufficient number of subjects needs to be placed in 
each of the experimental groups. This third re- 
quirement is particularly important in relation to 
the order of differences which are to be detected. 
If each experimental group is composed of five or 
ten subjects, only relatively large differences can 
be detected. By increasing the number of subjects 
in each group, it is possible to detect smaller dif- 
ferences. 

Discussion in the previous paragraph intro- 
duces a problem given little attention in educational 
research: What order of differences do we wish to 
detect? The answer to this question is based on 
answering the question raised earlier; What con- 
stitutes improved learning? Specifically, do we 
wish to raise the mean score of all pupils by one 
scale point, or are we interested only in instruc- 
tional methods which increase learning by five or 
ten scale points? 

The application of an experimental design al- 
Ways indicates the researcher’s explicit or implicit 
judgment or misjudgment on these questions. If the 
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effects of programmed learning are to be more de- 
finitive and comparability of findings is to be at- 
tained, more attention needs to begiventothe power 
of the tests of significance appropriate to the ex- 
perimental design. The concept of power refers to 
the likelihood that differences will be detected when 
differences actually exist; or conversely, the like- 
lihood of not detecting differences when differences 
do not exist. Statistically, this means it is desir- 
able to reduce the probabilities associated with 
Type II error, i.e., that of accepting the null hy- 
pothesis when it is false. Usually experimenters 
in programmed learning have been concerned only 
with Type I error, that of rejecting the null hypoth- 
esis when it is true, by setting the significance lev- 
el at . 05 ог. 01. 

A mathematical index of power is given by: 


g- (Ж) zA (Ref. 9:64) 


0 


where J is the number of observations or subjects 
in each experimental group, I the number of exper- 
imental groups, A the size of the difference to be 
detected, and c the standard deviation of the obser- 
vations. This implies, for example, that the power 
of the significance test is increased if 1)J, num- 
ber of observations, is increased, or 2)5is in- 
creased. Тһе power is decreased if 1) І, number 
of experimental groups, is increased, 2) c is in- 
creased, 

This formulation can be exemplified by refer- 
ence to our experiment where in the case of the 
final test results 02 = 41.3569 andg = 6.4310. In 
the most extreme case where we wish to detect a 
difference of one scale point, we might have the ef- 
fects: 

Bi -...-Ві-2 = 0, Ві-і --% 81-2» and В. = 0, 
where, again, Iis the number of experimental 
groups, the В; are the effects corresponding to each 
experimental group, and В. is the m ean of those 
effects, which is constrained to equal zero under 
our model. The fj above are chosen in order to 
achieve the difference most difficult to detect. The 
following derivation will show the power under these 
extreme conditions. 

The above relations imply that 


үле 2 
во cV = JEG, - B =1[0+...+0+@+@ 
i 


2 
= zs , which implies that 


ЖАП footnotes will be found at end of article. 


ы- JA 


JM А 
202”? and therefore 6 = ) DA 


Now since 9 = 6(v, + i 


where y, is the number of df for groups 
1 1 1 1 
-2 _(J\2A,72 Ј\2 д 
=втл ASAA жо (са ecu 
"ur (2 gl (à 9 
(Ref. 9:63) 
Specifically, could we in our research have detect- 
ed a difference of one scale point between the overt 
and covert response mode groups ? 
Let Y, 21, v4 = 5282 о, с -.05, І = 2, and 
552 


3-99. 276. 
H 
206]? 1 nan 
Theref: Ee ааа Secun ДЕЛЕ 38; 
poppe [25] 6. 4310 (69 (e wi 
ато 
= 6.4810 21-79 


This value of Q, 1.29, along with the above 
sighted a yields (refs. 9:438) the probability of 
Type II error (say Y) із. 45 and implies that the 
power, 1 - Y, is .55, Thus the chances are that 45% 
of the time we could accept the null hypothesis, of 
equality of effects, when the deviation from this hy- 
pothesis is greater than one scale point. One ap- 
proach to correcting this uncertainty is to increase 
the number of students needed in each treatment if 
we wished to detect a difference of one. 

If we set 1 - (the power) = 0.95, and a = 
0.05, then Ø = 2.55, where Y, = 1 and Уз ~ 0; 


9% а-а =(2)* [53 and 
29. e ur DX ian Al ni 
(2) ар 


therefore (a? = 2(2. 55)(6. 4310), and = 4(2. 55)? 
(6. 4310)? = 992, 
whereas before, J = 276 . 

Thus, under conditions similar to our present 
experiment (i. e., 0 = 6.4310), theoretically we would 
beableto detect a difference of one scale point withan 
associated probability of . 95, if J = 992. But the 
error per observation in a hard-to-controlexperi- 
ment of this size would probably increase and thus 
raise the overall error variance, 02, hence making 
the required J appreciably greater than 992. 


Design of the Experiment!* 


The design of the herein reported experiment 
was a fixed effects 2 x 3 factorial in which there 
were two response modes and three levels of intel- 
ligence. 4 No directional hypotheses were formu- 
lated prior to the experiment. (Ref. footnote p. 485.) 

This approach was taken primarily for two 
reasons. First, the many contradictory reports to 
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be found in the literature on programmed instruc- 
tion indicated exploratory studies were needed that 
adequately sampled the student population and so 
designed that maximum differences between treat- 
ments could be attained. Second, within each ex- 
perimental group the number of students should be 
large enough so as to fix the power of the test of 
Significance at an appropriate level, 

Another method of increasing power would be 
that of using psychometric techniques in test con- 
Struction, to lower the test variability. 


Subjects 


АП students enrolled in the Wauwatosa ninth- 
grade mathematics program were used as subjects 
for the experiment. The range of IQ’s (from the Hen- 
mon-Nelson Tests of Mental Ability) was from 76 to 
162, with а mean IQ of 114 These subjects 
were randomized into experimental groups by a 
Stratified sampling procedure (7) within each of the 
three levels of intelligence. This was done in ог- 
der to prevent a biased distribution of intelligence 
Scores among the corresponding experimental 
groups. The range of IQ's for each of the levels of 
intelligence was: high intelligence group 120-162, 
average intelligence group 112-119, and low intel- 
ligence group 76-111. Within each of these tertiles 
subjects were randomly assigned to each of the 
corresponding experimental treatment groups as 
defined by the two levels of mode of response. 


Programmed Material 


The subject matter used was a programmed 
unit in mathematics entitled ‘Sets, Relations, and 
Functions,"2 arranged in booklets with a total 
of 843 frames. The two modes of responses were 
defined in terms of the subjects’ responding overtly 
or covertly. Subjects responding overtly wrote the 
answer to each frame in a book provided; those re- 
sponding covertly were not provided with answer 
booklets and were directed not to write down their 
answers, 


Experimental Procedure3 


The experimental procedure was restrictedto 
the normal class periods devoted to mathematics 
over two weeks; it was administered throughout by 
the teachers who were normally responsible for 
teaching the respective classes. Within eachclass- 
room students proceeded under the various experi- 
mental conditions. Teachers first administered a 
pre-test,4then introduced students to programmed 
learning. Records were kept by the teachers of 
frames completed during each class period and the 
time (to the nearest minute) that each student worked 
on the programmed material. Students proceeded 
to work alone through the program without help. 
After each subject completed Book I, the teacher 


administered a test on material in that book. Afinal 
test4 was administered after each student completed 
the total programmed unit. The test given at the 
end of Book I and the final test were administeredin 
six different forms, the only difference being the 
order of the items. The number of forms were se- 
lected and distributed in a manner to control vari- 
ability and ease of administration. In addition a 
delayed retention test was given one month later, 
the results of which are not included in this report, 

A small percentage ofthe subjects did not 
complete the programmed material because of ill- 
ness and transfers, and consequently could not be 
utilized in the analysis. The smallest number of 
subjects in each group was 23 and, therefore, to 
equalize the subjects in each group 
for statistical analysis, additional subjects were 
removed randomly so as to reduce the number in 
each experimental group to 23. These losses re- 
duced the initial population from 609 to 552, 

The most commonly reported procedure for 
explicating differences among treatment groups is 
the analysis of variance followed by ‘‘t’’ tests be- 
tween selected comparisons. Ап overall analysis 
of variance continues to be statistically appropriate, 
but “07 tests between selected comparisons are 
now considered inappropriate. Before outlining а 
more suitable procedure for contrasting means of 
interest, mention should be made of an analysis of 
variance on the final test scores and learning time 
(in minutes); results are presented in Tables II and 
ПІ. The effect of intelligence was significant at a 
high level, the F value being 153.41. The interac- 
tion of IQ with response mode approached the . 05 
level of significance (F = 2. 87). 

Mode of response, overt and covert, was found 
to produce significant differences for amount of to- 
tal learning time, as shown in Table II (F = 177. 037). 


The most appropriate procedure for studying 
the treatment effects and interaction effects in this 
study is Scheffé’s (9) contrasting procedure. This 
procedure is illustrated with reference to the final 
test scores, Table VI. 


The experimental model (see footnote p. 485) is: 


ПИВ БС "АВ: AC). BC 
Yijkl- B+ irt ak + ој + Qik + Сір 


ABC 
*QGjk + ijk 


where A refers to the intelligence factor with index 
i, B to the response mode factor with index j, and 
C to the level of reinforcement factor with index 
k. The index 1 refers to the replication of subjects 
where the errors, €jjk], are distributed according 
to the usual assumptions and the model itself is 2 
fixed-effects one following Scheffe. 

The means for effects and means for interac- 
tion effects are given in Tables IV through VI. 
From these mean values the main effects and inter- 
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action effects can be found and are given in Tables 
VII through IX (except the third order interac- 
tion). For example, the interaction effect for high 
intelligence by overt response mode is: 


A т x + > 
Cs Ke ИЕ ХОТ а 


The standard deviations given with eachof these 
tables can be used as the basis for evaluating, by 
inspection, the degree to which the treatment ef- 
fects and interactions affected the dependent vari- 
able mean score on the final test. АП values in 
Tables VII, ҮШ, and IX lie close to or with- 
in one standard deviation; values in Table IX lie 
within two standard deviations. This indicates that 
the corresponding treatment effects and interactions 
were not significantly different, as previously 
shown by the analysis of variance. Values in Table 
VIII lie outside three standard deviations, indicat- 
ing significant differences among the three levels of 
intelligence. This is an approximate visual, eval- 
uation procedure and if any effects appear to be 
large, they should be checked by the use of valid 
contrasting methods. 

Further analysis and information about the 
treatment effects and interactions in which the in- 
vestigator is interested may be gained Бу contrast 
procedures such as the following. These proced- 
ures are illustrated by contrasting the mean of the 
high-intelligence-overt group with the mean of the 
middle-intelligence-overt group. These means have 
the greatest difference and would probably be sig- 
nificant at the . 05 level if a “t” test was performed. 

In view of the previously stated model, the em- 
pirical estimate of the parameters in this model 
(effects) are related to the calculated means in the 
way given below. For example, the calculation of 
a suitable effect contrast in the interaction of intel- 
ligence by response mode is as follows: 


Xi. =,©үз + бу ОН 
x „АВ >. АВА 
12,2 i2» GRE ОВ 
X. -2Ав pM, 
21.. 7^ (21 * C2 * ОИНИ 
X _ a AB АВ A 
22., = @22 + &2 - Q2 * Ш 


ТАВ SAB САЗЗОВИ АТЕВ 
АВ A ол Өз mu 


2 


„АВ АВ LA В SRE BN TAA 
ал к 5 +Â, +Â, + Ro + йз + 20 


2 


„АВ „АВ „АВ „АВ 
= Qir + Q22 - 012 - Har 


2 


„АВ „АВ „АВ „АВ 
аі. - Hie - His + Ais 


2 
T ^АВ „АВ ^AB ААВ 
since сү, =-@ 2; апа 012 = -Q22 
so that Žu- + Xo. X X. tX. 
2 2 
ААВ ^АВ ХАВ ^AB 
= 2(X:;-012) - Qir- 912 
2 


If one uses E[X,,,,] etc., (the expected value), 
then one gets ©, (the population values) etc. So 
the contrast of means, 7, is № = 


4AB „АВ {2 Х.1.. + Xoo... Е X. tX. 


64-0 z 
11 12 2 2 


_ 30.598 + 26.206 _ 24, 489 + 29.130 
2 2 


. 56.8043 _ 53. 619 
2 2 


= 28. 402 - 26. 809 
= 1.593 


In this contrast, 2, of means, the coefficient of the 
observations is 1 or 1 ; and so formally 


184 “184 
Gee T UN =Z aij Yu = 2 Cp Үк =1. 64 
since ajj = + ти so 2n igi 
= (92) 12" = уз 


Therefore, since 9$ = gx aij = 82 55), 


апа ѕіпсе s? = Ме = 41. 357 , 
then %- A - 0.4495 ; 
therefore 5% = V0. 4495 = 0. 670 


Now, |0 - |= 8. [ur defines а 95% confidence 


interval on j [the expected value Т) (the contrast)]. 
S.os = (a F.os ,а,Уе)? , where q is the relevant 


interaction degrees of freedom, Ve = 528 = oo and 

% 
F(,05, 2,00) = 3:00 . Then Sj = (о) (3. pre Ne 
Now we recall that |y - 2| 5 S.9¢ of 


АВ „АВ AB AB 
but %-0641-(1і2 50 ф-011- Hie 
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^ AB AB 
Therefore, |y - № | = [осу - «2 - 1.593| = 


Sor op = (2. 45)(0. 670) 


implies that |«АВ- «АВ. 1.593 |= (2. 45)(0. 670), 


which implies that [5253 RW. 1.593 | < 1.6415, 


АВ АВ 
which is equivalent to -1.64< с), - 02 -1. 59 
= 1.64 


AB AB 
-0.05- o; - a, = + 3.23 


and implies 
which is our 95% confidence interval for the differ- 
ence between the interaction of high-intelligence- 
overt and middle-intelligence-overt, or—since the 
interaction effect has to sum to zero—it is equiva- 
lent to the differences between high-intelligence- 
covert and middle-intelligence-covert. We notice 
that, since the confidence interval crosses over 
zero, the interaction does not attain significance at 
the 5% level, which agrees with the significance of 
F - 2. 87, barely below the 5% level. 


Interpretation 


The results of our experiment indicate that the 
only factor significantly associated with immediate 
acquisition is intelligence (Table II). The intelli- 
gence and response mode interaction (Table II) ap- 
proaches significance and seems to deserve further 
investigation, since the power of this. comparison 
is not large. 

The amount of time required to learn the pro- 
grammed material is affected significantly by the 
mode of response, the covert mode requiring sig- 
nificantly less time. 

As discussed earlier, the writers feel that the 
interpretation of such results should be made on 
the basis of the power of the tests of significance, 
andtheresults of such an analysis used to judge the 
benefit and direction of further experimentation. 

First, what are the chances of detecting differ- 
ences due to the effects of intelligence? Following 
the mathematical and statistical formulations as 
given inthe section оп Experimental Design in 
Perspective, we find that 


08? = ЈУ (8 - B. 
Сауд 552[ (5. 388)? + (0.871)? + (6.259)?] 
Татар ҚАЛИДЫҢ NEC TES m 


- (552) (68. 9642) 
5 41,357 


b^ та» 


5 = 30. 3394, 


0-5(»,,1)% 


NS 


therefore 


-(30.33944) L 
УЗ” 


= 17.51 


Since the difference is highly significant we would 
expect that the power of determining such a differ- 
ence will be very large, verified by the fact that 
the power №. 999 (see tables, ref. 9, as a func- 
tion of 0). 

We are virtually certain then that intelligence 
is a significant factor and needs to be taken into ac- 
count in further experimentation on programmed 
instruction. 

For the effect of intelligence vs. response 
mode interaction, with 


1-Y -.95,0:-.05, 7-92, I - 6, and Q = 2.28, 


it is shown that A - 5.3, so that we could havede- 
tected only differences of the given power at least 
5. 3 scale points, where the empirical differences 
меге of the order 1. 7! 

With these considerations in mind, we can re- 
turn to the question posed earlier. What order of 
differences are we interested in detecting for the 
purposes of utilizing programmed instruction? 

This question cannot be answered statistically, 
for it depends on the objectives of education togeth- 
er with the limitations of time, money, and effort. 
In the analysis of our experiment we have illustra- 
ted the case where the order of differences to be 
detected is опе scale point. In this experiment we 
have been successful only in identifying conditions 
under which differential learning of a magnitude of 
five scale points or larger occurred. 

We have not considered here the characteris- 
tics of the programmed material. Overall, it is 
obvious that the high intelligence and middle intel- 
ligence groups were able to perform better than the 
low intelligence group. Within these intelligence 
levels, however, learning was not affected by the 
experimental conditions. 


Summary 


This paper outlines an approach to experimen- 
tation on programmed learning within school set- 
tings. The objective is to establish a perspective— 
philosophical and experimental—by which the con- 
tribution of programmed instruction to education 
can be evaluated reasonably and advantageously. 
This objective is structured by three important ed- 
ucational questions: What constitutes im proved 
learning? Which students can improve by this form 
of instruction? What are the conditions under which 
improvement occurs? 


EV 
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Answers to each of these questions have been 
attempted with reference to an experiment in pro- 
grammed instruction. Тһе subjects for this exper- 
iment were 552 ninth-grade pupils who worked 
without help through a programmed unit in mathe- 
matics. Teachers normally responsible for these 
ninth-grade classes in mathematics administered 
the experimental procedures throughout the two- 
week period. 

Results of this experiment indicate that the 
most significant factor associated with immediate 
acquisition of the programmed material was intelli- 
gence. The covert response mode was foundtotake 
significantly less time for completion of the pro- 
grammed material without corresponding decrease 
on the immediate retention measure. 

The meaning of these results was discussed in 
terms of the power of the tests of significance and 
the consequent implications for designing future 
experiments in the area of programmed instruction. 
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TABLE I 


MEANS AND STANDARD DEVIATIONS FOR THREE DE PENDENT 
VARIABLES OVER ALL SUBJECTS 


(М = 552) 
Меап Standard Deviation 
Book I Test 11.25 3.12 
Final Test 24.48 8. 00 
Learning Time (minutes) 260. 37 62.18 


TABLE II 
ANALYSIS OF VARIANCE OF TOTAL SCORES ON FINAL TEST 


Source (Model Parameter) Sum of Squares df Mean Square F, 
«А (IQ, 1) 12689. 047 2 6344. 523 153. 41* 
cp (Response Mode, M) 8. 625 1 8. 625 
of? (IQ х Response Mode) 237.598 2 118. 799 2.87 
Енші Residual Error 21836. 435 528 41. 357 


*Significant beyond the . 01 level. 
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TABLE III 
ANALYSIS OF VARIANCE OF LEARNING TIME (TO THE NEAREST MINUTE) 
Source (Model Parameter) Sum of Squares dt Mean Square F 
A 
œj (IQ, 1) 9140. 229 2 ‚4570, 114 
B 
о} (Response Mode, M) 515085. 225 il 515085. 225 177. 037% 
AB 
eu (IQ x Response Mode) 646. 156 2 323. 078 
Eijk] Residual Error 1536200. 000 528 2909. 470 
*Significant beyond the . 01 level. 
TABLE IV TABLE V 
MEAN SCORES ON FINAL TESTS FOR MEAN SCORES ON FINAL TEST FOR 
RESPONSE MODES IQ LEVELS 
x a 
Overt 24. 351 High 29. 864 
Covert 24, 601 Middle 25. 347 
a so 
Low 18.217 
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TABLE VI TABLE VII 
MEAN SCORES ON FINAL TEST FOR INTER- MAIN EFFECTS FOR RESPONSE 
ACTION OF IQ AND RESPONSE MODES MODES* 
(N = 276) 
High Middle Low 
Б 5 Overt -0. 125 
Overt 30. 598 24. 489 17. 967 Covert +0. 125 
Covert 29. 130 26. 206 18. 467 5 
в =.150 *Error Term = 
AU i 41.351 
(M) 
TABLE VIII PABLE IE 
MAIN EFFECTS FOR IQ LEVELS* INTERACTION EFFECTS OF IQ AND RESPONSE 
(N = 184) MODES 
(N = 92) 
High 45. 388 High Middle Low 
Middle +0. 871 Overt +0. 859 -0. 733 -0.125 
Low -6. 259 Covert -0. 859 +0. 734 +0. 125 
ва = .449 *Error Term = 41. 357 
sin =.225  *Error Term = 20м) — 670 
sD -.474 41.357 (xM) ^ 
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An Experimental Evaluation of Programmed Learning’ 


THE STUDY reported here is an investigation of 
1) the effectiveness of a learning program, and 2) 
the effect of two variables on the outcomes of pro- 
gramed learning: (a) motivational differences among 


learners and (b) the form of the learners’ responses. 


This study is an outgrowth of an earlier study by the 
author (1). 


The Experimental Programed Text 


The programmed text used inthe study was a 280 
frame (8 lessons, 35 frames) Skinnerian-type learn- 
ing program titled “Ап Introduction to Counseling’’ 
(a revision of Ellison, ref. 2). The content of the 
program included mainly Porter’s categories of 
counselor responses (8) and parts of Perry and Es- 
tes (7) article dealing with a resolution of the direc- 
tive-nondirective dilemma in education and therapy. 
The behavior expected from the learner in relation 
to this content is made clear in the description of 
the dependent variable measures. (See description 
of scores below, and Table П appended. ) 

Structurally, the program is Skinnerian, moving 
gradually from highly prompted initial frames to un- 
prompted terminal frames which define the criterion 
behavior. There was no fixed standard for frequen- 
cy of review items, speed of fading, or type of 
prompts. One criterion of the effectiveness of this 
program is the percent of correct responding within 
the program. Оп а constructed response version of 
the program, the error rate on lessons тапредігот 
seven to 14 percent, with an overall error rate of 
11 percent. Тһе error rate on criterion frames a- 
lone (within the program) was 12 percent. However, 
Several criterion frames had error rates ranging 
from 12 to 32 percent. 

. An inspection of these frames reveals that they 
involve distinctions between Porter's *tevaluative'" 
and ‘‘interpretive’’ counselor response categories. 
The current version of the program has cut error 
rates on these difficult frames by 50 percent, but it 
appears that future improvements are not likely to 
be made without a redefinition of the *jnterpretive'" 
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and ‘‘evaluative’’ counselor response categories. 
The error rate on criterion frames, excludingthese 
few difficult items, is only five percent. Thus, we 
have a fairly adequate Skinnerian type program, 
structurally speaking, and we have discovered spe- 
cific weaknesses in the content of our program. 


Design and Procedures of the Study 


The learner response variable defines the four 
treatments in this investigation. They are as fol- 
lows: 

y Treatment 1 utilized a constructed response ver- 
sion of the program as described above. T he stu- 
dent read the item, wrote his answer on a separate 
sheet, then turned the page to find the correct an- 
swer. Students were limited to one lesson a day. 
After completing a lesson, they went back over the 
items missed. Three trials were allowed on miss- 
ed items. 

Example: In the layman's terms, *counseling' 
and ‘advising’ have often been thought of as synony- 
mous, but the process of counseling is more than 
giving 4 
Treatment 2 utilized the same program as Treat- 
ment 1 except that the initial letter(s) of the re- 
sponse called for were given in red. 

Example: In the layman's terms, 'counseling' 
and ‘advising’ have often been thought of as synony- 
mous, but the process of counseling is more than 
giving ad : 

Treatment 3 also utilized the same program and 
procedures as Treatment 1, except that the entire 
response called for was included in the program in 
red, the student was simply required to copy the 
response, and checking the answer or repeating i- 
tems was not done since no errors were made. 

Treatment 4 utilized the identical program used 
in Treatment 1, but the procedures differed. In this 
treatment, each student had a proctor who said 
“right”? if the answer Was correct, and ‘‘wrong, try 
again" if the answer written by the student was in- 
correct. The student kept trying until he got it 
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right or until he had made three trials. 

A record of time on the lesson and actual respon- 
ses were kept for each student. 

Before assignment to treatments, students were 
administered the American Council on Education 
Psychological Examination (ACE) and a work sam- 
ple program followed by semantic differential judg- 
ments of the program (to be described below). One 
hundred and seventy-five out of an initial sample of 
230 juniors in secondary education were used as 
subjects. Twenty-seven more were used as proc- 
tors for Treatment 4 and the remainder were used 
in another study. Students were randomly assigned 
to treatments and then a few cases were intention- 
ally shifted to avoid wide discrepancies in grade 
point average (GPA) and total ACE scores. 

There were no significant differences among 
treatments in GPA, ACE-Linguistic, ACE-total and 
pre-experimental evaluative judgments of the treat- 
ment to which they were assigned. There were sig- 
nificant differences in the ACE-Quantitative score. 

During the program (after the second and sixth 
lesson) evaluative judgments were made of the treat- 
ments. These evaluative judgments using the se- 
mantic differential techniques constituted our mea- 
sure of *motivation", After completion of the pro- 
gram, an immediate retention test was given and 
seventeen days later it was repeated. 


Description of Scores 


Scores referred to in the results are described 
here Some of these measures are based on pre-ex- 
perimental data, some on data obtained during the 
experiment and some on immediate post-experimen- 
tal testing and delayed (17 days later) post-experi- 
mental testing. 

1. Grade Point Average (GPA)-- This score is 
a cumulative grade pointaveragetaken from univer- 
Sity records except for a few cases (transfers) 
where we used the student's estimate. Treatment 
mean GPA's were 2. 69, 2.62, 2.77, 2. 73 respec- 
tively for Treatments 1, 2, 3, and 4 where а 4. 0 is 
an “А”. 

2. American Council on Education Psychological 
Examination--Linguistic (ACE-L)--Treatment 
means on this score were 69. 98, 68.96, 71.35, 
72.67 respectively for Treatments 1, 2, 3, and 4. 
L-score of 70 corresponds to about the 60th percen- 
tile in four year colleges on the 1954 college fresh- 
men edition. 

Semantic Differential Scores-- These scores 
were obtained by use of Osgood's (6) semantic dif- 
ferential technique. Using ten of his seven-point 
bi-polar adjective, evaluative scales (good-bad, 
valuable-worthless, fair-unfair, etc.), we had stu- 
dents judge all treatments before assignment to a 
treatment. Each student was taken through part of 
an earlier version of the Analysis of Behavior (3) 

doing two lessons (sets) in each of our four treat- 
ments. They then judged all four treatments on the 


semantic differential evaluative scales. Judgments 
of the treatment to which they were assigned were 
taken after the second and sixth lessons of the ex- 
perimental program. Scores 3, 4, and 5 below 
were based on this data. A high score is “боой” 
and a low score is ‘‘bad’’. 

3. Pre-treatment Evaluation of Treatment (EvT 
-pre)-- This score is a subject’s pre-experimental 
judgment on the ten evaluative scales of the method 
to which he was finally assigned. 

4, Evaluation of Treatment after second lesson 
(EvT-2)-- This score is a subjects evaluative judg- 
ment, after lesson two, on the ten evaluative scales, 
of the concept ‘‘treatment to which you were assign- 
ей”, 

5. Evaluation of Treatment after sixth lesson 
(EvT-6)-- This score is a judgment by the subject, 
after lesson six, on the ten evaluative scales, of 
the concept **treatment to which you were assigned? 

6. Total Error (TE) -- This score is simply the 
total number of within-program errors, where an 
error is either an omission of response or incor- 
rect response on the first try. There were, of 
course, no errors in Treatment 3. With 436 blanks 
in the 280 frame program, the total possible error 
range was 0 to 436. 

7. Total Time (ТТ)-- Time (in minutes) for 
completion of the program determined this score. 
A full 50-minute class period was available and on- 
ly one lesson per day was allowed. 

8. Total Immediate Retention (T-I)-- This score 
was based on the test given immediately upon com- 
pletion of the program. Actually, it was given dur- 
ing the two days following completion of the pro- 
gram. It is based on several subtests without suf- 
ficient reliability or independence for separate use. 
Reliabilities for this and the following score were 
in the 80's (see Table I). 

Sample items from subtests are presented in 
Table П. It is apparent that the test called for а 
wide range of behaviors in relation to the program 
content. One test (Composing Specified Counselor 
Responses) called for behavior not actually called 
for within the program. 

9. Total Delayed Retention (T-D)-- This score 
was obtained exactly as for variable 8 except that 


the test was given seventeen days later. 


Summary of Results 


1. Effectiveness of tne programmed text-- We 
mentioned above that there was a 12% error rate ой 
criterion frames within the program. This error 
rate dropped to 10% on the criterion frames test 
given upon completion of the program. The out- 
standing findings of an item analysis of post-treat- 
ment measures are: 

a. Achievement was rather good on all tests. 
Tnus, the median item p (percent passing) on sub- 
tests were : Cf-test (93. 5), Rg-test (89. 5), An-test 
(63.5), and Syn-test (73. 0). 
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b. The program was not as effective in get- 
ting subjects to label ‘‘new’’ counselor responses 
(Analysis Test) and to produce specified counselor 
responses (Synthesis Test) as it was in getting re- 
sponses actually called for in the program. 

c. The tests pointed up the kinds of behavior 
that need more effective programing (e.g. , the in- 
terpretive-evaluative distinction). It is clear that 
many students were able to produce written coun- 
selor responses even though they were not called on 
to do that in the program, but more effective pro- 
duction of this behavior might follow actually call- 
ing for such responses in the program. 

It appears that great variability on dependent 
variable measures is synonymous with poor pro- 
graming. This suggests some possibilities for re- 
search strategy in the case where we are still look- 
ing for factors influencing performance. 

a. Where a highly effective program has 
been developed, an analysis of individual cases 
might be made to generate hypotheses as to why 
some few subjects did not perform adequately or a 
greater number of the special cases with greater 
variability may be sought for investigation. 

b. While we no doubt have at hand the tech- 
niques for effective programing for the kind of 
learning outcomes we have been measuring, it is 
likely that other significant programing variables 
will arise as we attempt to construct longer pro- 
grams for a variety of learning outcomes over уаг- 
ying retention periods. 

2. The learner response variable--Conclusions 
drawn from data relevant to this variable are dis- 
cussed under a and b below. 

a. As indicated in Table III, we did not find 
significant differences among treatments on either 
the immediate or delayed total retention measures. 
Since we have already demonstrated the low error 
rate in our program (and in post-treatment mea- 
sures) our finding is consistent with previous stud- 
ies (5) indicating that variations inlearner response 
mode have little relevance when there isa high 
probability of correct responding. We had expected 
different results in the present study because we in- 
cluded some tests calling for behavior like that 
called for in the program and some not. However, 
as mentioned above, subtests Were generally too 
easy and thus lacked sufficient reliability for an 
adequate look at this possibility. Research is now 
under way extending two of the subtests with already 
fair reliabilities and low intercorrelations. 

b. Since our time and error variables yield- 
ed significant differences among treatments, it was 
expected that an analysis of covariance, taking time 
and errors into account, would yield significant dif- 
ferences among treatments on Total-Immediate 
and Total-Delayed retention measures. А sum- 
mary of results derived from such data is present- 
ed under 1 to 5 below: 

1) Errors within the program were sig- 
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nificantly negatively correlated with T-I and T-D 
retention measures in Treatment 1, 2and 4 (see 
Table IV). 

2) Mean Total Errors in Treatment 2 
were significantly lower than errors in Treatments 
1and4. Mean errors for Treatments 1, 2 and 4 
were 47, 21 and 47, with sigmas of 27, 11 and 19. 

3) Time on program was negatively cor- 
related with T-I and T-D retention measures on 
Treatments 1 and 4, but not in Treatments 2 and 3 
(see Table IV). 

4) Mean Total Time on program was sig- 
nificantly greater in Treatments 1 and 4 than in 
Treatments 2 and 3. Mean time (in minutes) for 
Treatment 1 to 4 respectively was 190, 150, 149, 
252, with standard deviations of 49, 37, 39 and 49. 

5) Applying Kramer's extension of Dun- 
can's Multiple Range Test (4) on the results of the 
covariance analysis (see Table V), we find that 
Treatments 2 and 3 were significantly more effi- 
cient than 1 and 4 in terms of the criterion of a~ 
chievement per unit of time and errors. These re- 
sults are consistent with those obtained in other 
studies using a Skinnerian type program. 

3, The motivational variable-- Conclusions rel- 
evant to our motivational variables (variables 3, 4 
and 5, described above) are summarized here. 

a. As indicated in Table VI, for Treatment 
1, subjects’ **good-bad"" judgments of the treatment 
were significantly correlated with performance 
when these judgments were made before the exper- 
iment or after the second lesson, but not when 
made after the sixth lesson. The lack of a signifi- 
cant correlation between evaluation of the treatment 
(after Lesson 6) and T-I and T-D retention may be 
explained largely by the fact that the mean evalua- 
tion of the treatment went up and the variance be- 
came quite low. Osgood (6) found that a shift in 
factor score of .4 score units is significant at the 
105 level. If this is so for our data (we have no 
stability measures), the shift from Ev-T-pre to 
EvT-6 is significant. 

b. Also in Table IV we find that in Treat - 
ment 3 (the copying treatment), the only significant 
correlations between motivation and achievement 
were the negative correlations between evaluation 
of treatments after Lesson 6 and T-I and T-D re- 
tention. It is tempting to speculate concerning 
these results. It appears that one’s judgment of 
method 3 changes with experience in the program 
(e.g. 5 correlation pre-experimental evaluation and 
Lesson 2 evaluation is .46, but between Lesson 2 
and Lesson 6 it is . 13). This raises the question 
as to what happens to attitudes after experience 
with а method. It appears here that they change, 
put are fairly stable after the second lesson. The 
negative correlations between these judgments and 
test performance could be due to interfering emo- 
tional responses when those who rated the treat- 
ment favorably found that the answers did not come 
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immediately to them on the post-experimental mea- 
sure, 

c. Analysis of covariance, taking semantic 
differential judgments of treatments into account, 
Still did not yield significant treatment differences 
on our dependent variable measures. 

In summary, our data show that we have a fairly 
effective constructed response program in terms of 
high correct responding on criterion frames and 
other post-experimental measures. For this pro- 
gram, the variations in learner response mode 
which we introduced produced no differences in a- 
mount of immediate or delayed retention. However, 
the more highly prompted Treatments (2 and 3) 
yielded an advantage of efficiency over the other 
treatments. That is, they produced the same im- 
mediate retention with less time and fewer errors 
in learning and the same delayed retention with few- 
er errors inlearning. Also, adjusting for differ- 
ences in our motivational variables yielded no treat- 
ment differences on immediate and delayed reten- 
tion. However, in the constructed response form 
(Treatment 1) the subjects’ evaluations of the treat- 
ment were positively related to performance and 
showed an increasing preference with experience in 
the treatment. The significant finding in relation to 
motivation is that it does make a difference on with- 
in-group variance for the constructed response pro- 
gram and that this suggests further research on 
long-term effects and on how motivation operatesto 
influence performance as well as suggesting some 
immediate practical procedures and advantages for 
taking preferences into account in assigning pro- 
grammed texts to students. À 
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ESTIMATES OF KUDER-RICHARDSON FORMULA NUMBER 20 RELIABILITIES AND TOTAL 
IMMEDIATE (Т-1) AND TOTAL DELAYED (Т-р) RETENTION MEASURES 
IN EACH TREATMENT ON A 128-ITEM TEST 


— M— M M—— — M M € 


Variable Mean S.D. Range Reliability* Median 
Treatment 1 (М - 50) 
тұз? 96.36 13.37 69-118 ‚865 96.17 


66-117 


Treatment 4 (N - 27) 


T-I 98.23 13.19 
T-D 96.88 12.06 
T-I 100. 96 10.48 
T-D 98. 00 11.90 


81-121 
70-117 


* 


Procedure for estimating reliability is Saupe's formula 22 (9). 
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SAMPLE ITEMS FROM SUBTESTS IN POST-EXPERIMENTAL MEASURES 


L-8 (34) 


L-8 (24) 


as 
72. 


Criterion Frames Test (Cf) 


The four types of errors commonly made in attempting an understanding response are: 


1. 2. 
3. 4. 
Out-and-out interpretation, particularly if too premature, may result in an error of 


Example: 


Client: “I can't remember his name. ” 


Recognition Test of Terminology and Theory (Rg) 


«Heuristic set’? has been reached when: (a) the counselor reflects the client’s feelings 
and perceptions; (b) the client accepts the counselor’s advice; (c) the client looks at 


the counselor as a source of information to be considered; (d) the client’s negativism, 


Test of Ability to Analyze Client-Counselor Interchanges (An) 


Choose the counselor response imappropriate in Depth. (A depth error in making an 


understanding response. ) 


Client: *I^m amazed at the way I can fool myself. I thought something was wrong. 


The good me...and the bad me...I was worried. Thought I might be a schizoid 


personality. 1 wasn't worried seriously. ” (а) **You have been somewhat worried 


about whether you might be a schizoid personality. ”” (b) “Іп other words, some 


serious questions came up, but they didn't worry you too much. " (с) *You have 


kept fooling yourself so much it's driving you crazy. » (d) ‘There is really nothing 


to be worried about. ” 


Test of Ability to Synthesize or Compose Specified Counselor Responses (Syn) 


Make an Interpretive counselor response. 


Client: “I have a decision to make, but there are personal problems behind it that 


a doctor or psychiatrist should probably deal with. ’” 


Make an Understanding response with a Content error. 


Make an Understanding response with a Language error. 
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TABLE III 


ANALYSIS OF VARIANCE OF TOTAL IMMEDIATE AND TOTAL DELAYED 
RETENTION MEASURES ON FOUR EXPERIMENTAL TREATMENTS 


Source Sums of Squares df Mean Squares Е* 


Total Immediate (T-I) 
Among 785 3 262 1.5 
Within 29636 171 173 


Total Delayed (T -D) 


Among 360 3 120 may 
Within 32154 171 188 
Total 32514 


* For 3 and 171 degrees of freedom an F of 2. 66 is required for significance at the . 05 level. 


TABLE IV 


CORRELATIONS BETWEEN UNCONTROLLED VARIABLES AND TOTAL IMMEDIATE 
(T-I) AND TOTAL DELAYED (T-D) RETENTION MEASURE 


==————————————————Є—Є—ЄїЄ—ЄЄ—Є—ЄЄ—ЄїЄ—Є—Є&Єү——Є—ЄїүїЄк&Є&][ 
Variable Mean S.D. nsi T-D 


Treatment 1 (N - 50) 


GPA 2.69 53 ‚19 .22 
ACE-L 69.98 14. 66 .55 .51 
ТЕ (Error) 41.48 27.44 -.54 -.50 


TT (Time) 


GPA 

ACE-L 

TE (Error) 20.50 10.99 -.33 -.36 
TT (Time) 149.50 36.70 03 -.19 


Treatment 3 (N - 48) 


GPA 2.77 .60 .48 .42 
ACE-L 71.35 13.28 -37 .40 
TE (Error) 7 -4---5 20-2 55-2 ---- Se 
TT (Time) 148. 90 


GPA 2.73 .62 .51 .54 
ACE-L 72.67 16.61 .55 .56 
TE (Error) 47.04 19.33 -.69 -.64 
TT (Time) 231.70 48.70 -.43 -.38 


Values of г for significance at the . 05 level (in parentheses) and . 01 level (underlined 
and in parentheses) are as follows: N - 50 (. 24) (. 33) ;N = 48 (.24) (.34); М = 27 
(. 32) (. 45); these are one-tailed tests. 
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THE JOURNAL OF EDUCATIONAL RESEARCH 
(Volume 55, Number 9, June-July 1962) 


An Investigation of Presentation, Response, and Correction 


Factors in Programmed Instruction" 


THE PURPOSE of this experiment was to test 
the comparative effectiveness of five programming 
arrangements. А standard programming format 
hàd been developed that theoretically should maxi- 
mize learning efficiency." At each step in this 
program an item of information and an example 
relevant to the information was presented, a ques- 
tion on the item asked, correction of answers pro- 
vided, and an explanation of the correctness of an- 
swers and incorrectness of alternatives given. In- 
formation content is easily mastered at each step 
in this program and performance is continually 
monitored. 

The second program format omitted the expla- 
nation for the correctness of answers at eachstep. 
This program was used to determine if explanation 
with correction increased the effectiveness of the 
reinforcement procedure. The third program pre- 
sented the information and example but no question 
was asked. This condition tested the relative ef- 
fectiveness of overt practice and immediate rein- 
forcement. Тһе fourth program presented infor- 
mation, question, and correction but omitted the 
example. A comparison of performances in this 
condition and the first condition tested the effec- 
tiveness of adding an example at each step of the 
program. Тһе fifth program presented only an 
item of information on each page of the program. 
This condition is theoretically the weakest condi- 
tion; however, previous experimentation had indi- 
cated that students learned as well in this condition 
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as under the standard programming condition (1,2,4). 
Conditions three and five were included to replicate 
previous results. Table I summarizes the experi- 
mental arrangements. 

The standard program used in condition one in- 
corporates the elements regarded as essential in 
programming techniques., Each item of informa- 
tion is easily mastered, a response is required at : 
each step of the program, and immediate correc- | 
tion is provided. Тһе addition of an example at | 
each step and a review of the contentin the correc- si 
tion procedure should enhance learning. The addi- 
tion of the example is a form of the “гшер”” pro- 
gramming procedure (3). Conditions two and four 
are also common programming arrangements. А ү 
comparison of these conditions with condition one 4 
permits a test of the effectiveness of reviewing in- 
formation in the correction procedure and of add- 
ing an example at each program step. 

The third and fifth conditions are ‘‘weak’’ con- 
ditions. Condition three adds explanation in the 
example and probably should be more effective 
than condition five. 

Although our prior research results (see above). 
suggest that these program variations are not 
likely to yield substantial differences, it was de- 
cided to use commonly held conceptions in making 
the experimental predictions. Consequently, it was 
predicted that (1) conditions one, two, and four. 
Should prove superior to conditions three and five 
in their effects on learning; and (2) condition three 
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should prove superior to condition five. It is not 
possible to predict which of the conditions, two or 
four, will be superior; however, the design per- 
mits a test of relative effectiveness. 


Methodology 


The programs taught an original two-person 
board game roughly similar to games like chess 
and checkers. A twenty-seven item program de- 
scribing the rules and elementary strategy of this 
game was developed and tested on several hundred 
high school students. Each item in the program 
was of minimum difficulty. Ninety percent of 
samples of senior high school students answered 
correctly questions on each item and the total 
error rate for all items in the program was ten 
percent. This content was used because it was 
unfamiliar to the learner and because some general 
interest in the content might be expected. Experi- 
ence with the program indicated that the range of 
interest in its content was similar to the range of 
interest in a typical school subject in an average 
, classroom, 

The programs were presented in booklet form 
with an item of information, an example, and a 
question on each page. The correction with expla- 
nation appeared on the following page. All pro- 
grams consisted of twenty-seven items, and iden- 
tical information was presented at each step. 
Variation in this format was achieved by omitting 
one or more of these elements. Students wrote 
their answers to each question on a separate an- 
swer sheet and were instructed not to turn the | 
| page before answering the question. Supervision 
was maintained to insure that this procedure was 
followed. ( 

The programs were administered in students’ 
classrooms. Subjects were randomly assigned to 
. conditions. The shortest form of the program re- 
quired approximately ten minutes to complete, the 
longest form from thirty to forty minutes. Stu- 
dents were informed that they were participating 
in an experiment to study ways of making teaching 
and learning more effective and more interesting. 

On completion of the program each student was 
administered a test on the content of the program. 
This test comprised twenty-seven items. Each 
item corresponded to an item of information in the 
program. These items had been selected from a 
larger pool of fifty-three items. The items in 
this pool ranged in difficulty from forty to sixty 
per cent and each item correlated highly with total 
Score (гр = . 90 and above). The split-half relia- 
bility of this fifty-three item test was . 85 (correct- 
ed). The shorter form used in this experiment 
produced a split-half reliability of . 62 (corrected). 
The criterion test was also administered three 
weeks after the initial testing. 

One hundred and twenty-two high school stu- 


dents, juniors and seniors, were the subjects in 
this experiment. All students were enrolled in the 
same high school and were largely of middle and 
upper-middle socio-economic status. The general 
level of scholastic aptitude was above average. 


Results 


A one-way analysis of variance across condi- 
tions was performed on initial criterion scores. 
The results of this analysis are presented in Table 
Il. The differences across conditions are not sig- 
nificant. A similar analysis was performed on 
final criterion scores and the results are reported 
in Table Ш. Again, the differences are not sig- 
nificant. 

The means and variances for each condition for 
initial and final criterion scores are presented in 
Table IV. These data illustrate how small the ab- 
solute differences in mean scores are. The differ- 
ences among the variances, which are somewhat 
larger, were tested using Hartley's test and were 
found to be non-significant. 5 

Conditions 1, 2, and 4 are strong conditions, 


_ each providing for response evocation and imme- 


diate correction. These conditions did not yield 
significantly different performance scores on the 
criterion measures. Errors made on the program 
itself were examined to determine if the program 
variations affected intra-program performance. A 
one-way analysis of variance using program error 
Scores for these three conditions was performed. 
The results of this analysis are presented in Table 
V. The differences across conditions are not sig- 
nificant. 

The verbal scores of the Differential Aptitude 
Tests were available for these students. Sincethe 
distribution of scores was markedly skewed, a 
Chi-square analysis was performed to test the as- 
sociation between aptitude scores and final crite- 
rion Scores. A two-way contingency table was 
used dividing aptitude scores at the fiftieth percen- 
tile. The Chi-square value was 4. 79 for one de- 
gree of freedom which is significant at less than 
the .05 level. This Chi-square value yields a con- 
tingency coefficient of .26. These data suggest a 
moderate relation between these variables. 

Two other analyses were performed to explore 
for possible relations. Subjects were separated 
into two groups, a high verbal aptitude group (PR- 
90 and above), and a lower aptitude group (PR- 
89 and below). One-way analyses of variance for 
each group across program conditions was per- 
formed. Neither F value was significant (Е = 
1.27 and 1. 24 respectively). 

Subjects were separated by grade levels and 
combined across conditions to form two groups. 
Three one-way analyses of variance were per- 
formed on error scores for initial and final post- 
tests and for intra-program performances. Dif- 
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ferences between grade levels were significant for 
all measures. The results of these analyses аге 
presented in Table VI. These data should be inter- 
preted cautiously. Neither group at each grade 
level represents a random sample of subjects. An 
inspection of the verbal scores for both groups 
suggests that the senior group (two classes) is su- 
perior. 


Conclusions and Discussion 


The data yielded by this experiment indicate 
that the program variations did not differ signifi- 
cantly in their effects on learning. The lack of 
difference between mean scores in the *'strong" 
and ‘‘weak’’ conditions is consistent with previous 
experimentation. 1,4 In three experiments, in- 
cluding this one, the strongest and weakest condi- 
tion have yielded comparable results. 

It is important, however, to remember that the 
program used in these experiments is a short pro- 
gram, This fact may have two implications. First, 
that within some as yet undefined length limit pro- 
grammed content does not yield superior learning. 
This lack of difference may be due to the ability 
of the learner to comprehend and remember these 
**short'" units. It may also be due to the selective 
arrangement of the non-programmed material. 
Condition five, for example, presents the content 
in its **cleanest'' form unencumbered with detail, 
discursive explanation, allusions, extended illus- 
trations, and extensive applications. Тһе signifi- 
cant ideas and facts are clearly delineated and it 
has been empirically demonstrated that this con- 
tentis easily understood. It also seems probable 
that the effects of these two factors, length and 
arrangement, would be related to the capacities 
of the learner. 

The second implication is that the effects of 
programmed instruction will be demonstrated for 
units of content which exceed by progressively 
larger amounts this as yet undefined upper limit 
for *short" units. Should a relationship of this 
general character be demonstrated, one of the 
technical problems of programming would be clear. 
The problem would then be to develop systems of 
programs of varying lengths and designed for spe- 
cific purposes. This approach would probably also 
focus attention on other aspects of efficiency than 
the rate at which a learner moves through a body 
of content. 

Another possibility is that the content used in 
this program is ‘‘simple, ’’ and that the effects of 
programmed instruction are more readily demon- 
strable for complex" material. This suggestion 
is at best heuristic since adequate definitions of 
**simple'' and “complex” have yet to be formu- 
lated. The content in the programs used in this 
experiment resembles units of content in many 

school subjects. Junior high school students, for 


example, found the program more difficult than 
senior high school students.“ However, a taxon- 
omic classification of subject content is required 
before relations between type of content and pro- 
gramming format can be predicated. 

The moderate relation between verbal ability 
and the differences between grade level perform- 
ances (although uncontrolled for differences in 
ability) found in this experiment suggest that in- 
teractions between programming format and abil- 
ity may be found. Unfortunately, data were not 
available in the required form to test for these in- 
teractions in this experiment, a problem further 
complicated by the skewed distribution of ability 
in the subjects used. 

There is a suggestion in the data in this experi- 
ment that one of the effects of programmed instruc- 
tion may be to reduce the range of individual dif- 
ferences. Although the differences in variances 
across program conditions were not significant, 
condition five yielded the largest variance on both 
criterion measures. If the above speculation on 
length of program proves valid, it is not unlikely 
that these differences in variation would increase _ 
significantly. 

It is obvious that this experiment focuses atten- | 
tion on the question, under what conditions does 
programmed instruction yield a marked improve- 
ment іп learning? Тһе data presented here sug- 
gest that programmed instruction, like other in- 
structional methods, has bounds outside of which 
its relative effectiveness is not demonstrable. 


Summary 


One hundred and twenty-two high school students 
were randomly assigned to five program conditions. 
Condition one presented information and an ex- 
ample and evoked a response from the learner at 
each step in the program. Тһе response was cor- , 
rected immediately and an explanation for the cor- 
rectness was provided as part of the reinforcement 
procedure. Condition two did not provide the ex- 
planation as part of the correction procedure. Con- 
dition four omitted the example. The third condi- 
tion presented only the information and example, 
and condition five only the information. АП pro- 
grams presented identical information at each step. 

А twenty-seven item test measuring knowledge 
of the content was administered to each subject im- 
mediately on his completion of the program. The 
same test was administered three weeks later. 

Analyses of variance were performed on scores 
on the two criterion measures and on the program 
error scores for the three conditions (1, 2, and 4) 
for which these scores were available. No signifi- 
cant differences across conditions were found 
either for mean scores or variances. 

An analysis of the relation between verbal apti- 
tude scores and performance scores indicated a 
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moderate relation between these variables. Grade 
level differences (juniors and seniors) across all 
program conditions were found, though this rela- 
tionship should be regarded as suggestive at best 
since it was not possible to control for differences 
in ability in this analysis. 


It is suggested that these results may be a func- 
tion of the length of program used. 1f this sugges- 
tion has validity the implication of these results 
may be that the effects of programmed instruction 
are demonstrable for **longer"' units of content. 
The problem of research on programmed instruc- 
tion is, in part then, to determine the conditions 
under which programmed instruction is relatively 
more effective than instructional alternatives. 
Three categories of potentially relevant factors 
may be the length of the program, the complexity 
of its content, and the ability level of the learner 
using a particular programming format. 
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PROGRAMMING ARRANGEMENTS FOR EACH EXPERIMENTAL CONDITION 


ЕЕЕ аа 


Experimental Conditions 


Program Elements i ў д А 3 
a ЕЛ” 
Information x x x x x 
Example x x x 
Question x x x 
Correction x Ж z 
x 


х 


TABLE II 


LYSIS OF VARIANCE FOR EXPERIMENTAL CONDITIONS ON CRITERION TEST, 
ue INITIAL POSTTEST (N- 122) 


Sum Squares df E 


Between 23. 88 4 


117 .403 (n.s.) 


Within 1732.16 
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TABLE III 


ANALYSIS OF VARIANCE FOR EXPERIMENTAL CONDITIONS ON CRITERION TEST, 
FINAL POSTTEST (N = 100)* 


_————————ш—+———————ЄЄ——————— 
__—————Є—Є—_——— 


Sum Squares df F 
Between 47.9 4 
Within 1017.1 95 1.1 (n. s.) 


* Absences and schedule changes account for the decrease in N. 


TABLE IV 


MEANS AND VARIANCES (ERROR SCORES) FOR INITIAL AND FINAL POSTTESTS 


Initial Posttest 2 Final Posttest 


Condition M s? M 5 Mean Change 
1 9.2 14.6 10.3 8.3 1.1 
(N = 24) (М = 19) 
2 10.0 16.5 11.3 7.8 1.3 
(N = 19) (N = 16) 
3 8.7 11.3 12.0 9.6 2.7 
(N = 25) (м = 25) 
4 8.7 12.4 10.4 14.8 
s ints 
(N = 27) (N = 25) 
5 9.3 19.3 10.5 11.8 
ч 1.2 
(N = 27) (N =15) 
Range diff. 1.3 8.0 iat 7.0 


Se ee re en cm 


McDONALD - ALLEN 507 


TABLE V 


ANALYSIS OF VARIANCE USING PROGRAM ERROR SCORES FOR CONDITIONS 1, 2, and 4. (N = 72)* 


LL——————— E E ÁO 


Sum Squares df F 
Between 13. 00 2 
Within 573. 45 69 . 182 (n.s.) 


| 
+ Two students completed the program who did not complete the criterion test. 


TABLE VI 


TESTS OF SIGNIFICANCE OF DIFFERENCES BETWEEN GRADE LEVEL ERROR SCORES FOR INITIAL 
AND FINAL POSTTESTS AND PROGRAM PERFORMANCES 


ERE нт 


Initial Posttest Final Posttest Program* 2 
M s2 M s2 M s 
Juniors 10.35 13.22 12.26 7.36 3. 68 9.42 
(N = 70) (N = 54) (М = 41) 
Seniors 1.54 11.90 9.51 9.12 1.58 4.46 
(N = 52) (N = 45) (N = 31) 
t 4.32 4.56 3.50 
df 120 Ti 10 
< .01 « .05 


p « .01 
* Program scores are available only for ree conditions, 1, 4, and 4. 
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An Exploratory Evaluation of a Discriminative Transfer 
Learning Program Using Literal Prompts?” 


THIS ARTICLE describes a short learning pro- 
gram which exemplifies what seems to be a general 
program type. The program is intended as an il- 
lustration of one approachto programming when the 
behavior to be learned involves discrimination 
training, A learning program or a part of a pro- 
gram is usually designed to carry out specific be- 
havioral processes, e.g., acquisition ofa response 
or discrimination of stimuli. Most available pro- 
grams consist of a rather difficult to describe com- 
bination of such basic behavioral processes. For 
the purpose of research in programming, it seems 
fruitful to isolate the separate functions of pro- 
grammed sequences so that they can be subjected 
to individual study and so that the several functions 
can be most efficiently combined, 

Investigations of this kind, i.e. › the study of 
programming techniques relevant to particular 
learning processes, are required for the transla- 
tion of behavioral principles into instructional pro- 
cedures. Program writing will be improved to the 
extent that the program writer can use a working 
knowledge of the behavioral principles involved. 
This paper is an attempt to show how one such basic 
principle, discrimination training, was applied in 
а rather “риге” form to the construction of a pro- 
gram. 

The application of the discriminative training 
paradigm to human learning programs rests on 
simple assumptions and permits a clear specifica- 
tion of the procedure to be followed. In bringing a- 
bout discriminative control, it is simply required 
that the desired response be elicited in the presence 
of the stimulus which is to acquire discriminative 

properties, and that reinforcement be Contingent 
upon such an occurrence. It must be assumed at 
the beginning of discrimination training that the re- 
sponse over which control is sought exists at some 
strength in the repertoire of the organism. If the 
response is weak or improbable, it can be elicited 
in the presence of the discriminative stimulus by 
presenting, in addition, some stimulus which pre- 


—— ME 
* Footnotes will be found at the end of the article. 


JULIAN |. TABER? and ROBERT GLASER 
University of Pittsburgh 


viously was brought to control the response. Such 
auxiliary stimuli which are initially used to prompt 
the response may, as training proceeds, be gradu- 
ally dropped. In this way, the control of the re- 
sponse is transferred to the new stimulus. With 
the human organism, it is possibleto carry on sev- 
eral such training procedures simultaneously. 

The auxiliary stimuli which initially produce the 
response оп the appropriate occasion may have а 
variety of forms. With human verbal behavior, it 
is possible to substitute one stimulus for another 
in most cases where both stimuli evoke the same 
form of behavior. For example, a picture ofa 
house and the word **house"' can both be used to e- 
voke the same verbal response. To bring the vocal 
response under the control of the written word, the 
discrimination training paradigm suggests that the 
picture be presented together withthe word andthat 
the picture be gradually faded or dropped from the 
situation. This transfer of stimuluscontrol in pro- 
grammed learning is in line with suggestions made 
by Skinner (1959) who proposed a situation in which 
verbal responses could be attached to non-verbal 
stimuli as in the learning of geography or anatomy. 

Although Skinner (1957) has treated the problem 
of prompting in verbal learning, the actual tech- 
nology of prompting is only beginning to receive 
experimental attention (e. g. , Angell and Lumsdaine, 
1960). It is possible to state several simple pro- 
cedures by which the prompts may be dropped from 
a learning sequence. One method of removing the 
auxiliary stimulus has been called vanishing. The 
term is best taken to mean that the prompt is liter- 
ally vanished, part by part. Fading is another 
technique of prompt removal and again, a literal 
interpretation seems to serve best. In fading, the 
prompting stimulus is actually faded in intensity, 
saturation, or some other physical quality while all 
other parts of the prompt remain the same. Athird, 
though infrequently employed technique of prompt 
removal is prompt distortion. Using prompt dis- 
tortion, for example, it would be possible to elicit 


! 


TABER - GLASER 509 


the word “сате!” with a picture of a camel and 
gradually distort the picture into the written letters 
that spell the word. As research on programmed 
learning proceeds, these and other methods of 
prompting will be further developed if they are 
found to facilitate learning. 

The program described in this report was de- 
signed to examine whether a simple application of 
the discriminative transfer training par adigm, 
making use of literal prompts, would provide an ef- 
fective instrument for use in the area of verbal 
learning. The programming technique developed 
involves; a) the transfer of controlfrom a previous- 
ly learned and effective stimulus to a new stimulus 
(discriminative transfer training), and b) the actual 
vanishing of portions of the previously controlling 
stimulus. This combination is called «діѕсгітіпа- 
tive transfer programming using literal-pro m pt 
withdrawal. ”’ 


The Program 


The program described here was called ** Learn- 
ing Color Names." It was designed toteac h the 
reading of color words to beginning first grade stu- 
dents. The program capitalized on two simple 
facts. First, that a color may be represented by 
the color itself or by the word-name for the color, 
both having the same verbalization as an appro- 
priate response. Second, children learn correct 
verbalizations to color before they learn correct 
verbalizations to color names. Observing the dic- 
tum that a program should use existing behavior as 
a basis for building new modes of behavior, the task 
of this program was to transfer or extend a verbal 
response from one stimulus (a wave length of light) 
to another (a word). 


The ‘Learning Color Names" reading program 
was designed to teach the names of eight common 
colors: brown, black, purple, blue, orange, green, 
yellow, and red. The behavioral prerequisite to 
successful use of the program was the ability to 
name the colors correctly when à colored object 
was shown. This program taught “геадіпе”” in the 
sense that the child learned to respond vocally to a 
specific set of letters in а fashion commonto adults. 


By pairing the actual color with the color name, 
and gradually vanishing the color prompt, it was 
hypothesized that the verbal response would trans- 
fer from the color to the name of the color. Since 
fading a color prompt involves printing difficulties, 


vanishing was chosen as the most feasible technique. 


It remains a question, however, whether fading or 
vanishing is the superior method. Figure 1 shows 
how the color prompt for the word “‘blue’’ was van- 
ished throughout the programmed sequence, —— 

In each cycle of the program, each of the eight 
words appeared once. In the early cycles of the 
program, only one word appearedon eachpage, and 


four appropriately colored lines radiated out from 
the printing of the word. In each successive cycle, 
the prompting color lines decreased insize until 
they appeared only as dots andfinally were omitted 
altogether. A large manuscript style of printing 
was used, and all words were printed in black ex- 
cept as noted in Table I. The prompting of each 
cycle is described in Table I. Eachírame inthe 
program was on a separate sheet of five by eight- 
inch paper, and consisted of only one color word 
with or without the appropriate prompting lines. 
Each word appeared only once in а cycle containing 
all eight words, and there were eleven cycles in 
the entire program. Thus, in going through the 
program, a child read each of the eight wordsa to- 
talof eleven times. The order in which the words 
appeared was varied from cycle to cycle. The en- 
tire program consisted of 74 frames. 

The program described in Table I was consider- 
ed a transitional experimental form. Several var- 
iations of the program led to the for m at reported 
here. In addition, several variations ha ve been 
made as a result of experience with it, For further 
work, the omission of cycle eight is suggested be- 
cause this cycle was found to lead occasionally to 
confusion. It also seems desirable to break the 
program up into two parts so that learning could be 
done in two sessions rather than in one. With two 
parts, four words could be learned completely in 
the first session, and the remaining four words in 
the second session. Although learning all eight 
words with the color names program usually took 
only a few minutes, the children were intensely ac- 
tive and seemed to tire quickly. Allversions of the 
program had a cover with the title ‘‘Learning Color 
Names.” The letters of this title were printed in 
the eight colors of the program and were used as a 
pre-test to test the ability of the child to identify 
the colors. 

The program was administered by an adult who 
sat with the child and generally turned the pages, 
attempting to adjust the rate of frame presentation 
to the child’s readiness to respond. The first 
frame of the program was shownto the child and he 
was asked simply to ‘read the мога” or ‘read 
what this word says." Further directions were 
usually necessary, however, to draw attention to 
the colored lines, but after the first frame, no fur- 
ther urging seemed to be needed. From the second 
frame on, the child went through the program call- 
ing out the name of the color represented by the 
word and the associated color prompts. Libe ral 
use was made of such reinforcers as “right” and 
*tgood" and repeating the color wor d after the 


child. 
Program Development and Tryout 
The first version of the program used all type- 


writer capitals in printing the letters, and a single 
vertical colored line drawn throughthe center of the 
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CYCLE 10 


red black 


green orange 
purple blue 
brown yellow 


FIGURE 1. 


from the Learning Color Names program showing prompt 
austin for the word ‘‘blue” at different points in the program. 
All half-tones in this figure were printed in blue in the orig! 


program. 


TABLE I 


PROMPT FADING USED IN THE **READING COLOR NAMES" PROGRAM 


Cycle 
Cycle 
Cycle 
Cycle 
Cycle 
Cycle 
Cycle 


Cycle 


Cycle 


Cycle 10: 


Cycle 11: 


9: 


Colored radiating lines 1 inch long. 


Colored radiating lines 1/2 inch long. 


: Colored radiating lines 1/4 inch long. 

: Colored radiating lines 1/8 inch long. 

: Four colored dots, replacing the lines at the points where the lines originated. 
: Two colored dots, placed at the beginning and end of the word. 


: Nolines or dots. Each word printed in its appropriate color. 


Two letters of each word printed in the appropriate color, the rest of the letter 
printed in black. 


АП words in black, no prompts. 


Cycle ten consisted of one page with all of the eight words printed together on 
the page in black. 


Cycle eleven was the same as cycle ten except that the words were printed in 
a different order. Cycles ten and eleven were included as pre-tests and post- 
tests of reading ability. 


A 
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words. None of the letters in any ofthe words ever 
appeared in anything but black. This version was 
tried out on two six-year-old children. These child- 
ren could read all eight of the words at the end of 
the program. The program was giventothese child- 
ren approximately six weeks after they had started 
first grade, where they had had some exposure to 
color words. 

A modified version of the program wasthen pre- 
pared which was similar to the one described in 
Table I except that this version did not use colored 
letters in cycles seven and eight, i.e., the frames 
in cycles seven and eight were simply unprompted 
criterion frames printed in black letters. The in- 
itial tryout of this version was with six first-grade 
children selected by their teacher from a reading 
readiness group. These children were considered 
slow learners and their I.Q.’s ranged between 75 
and 90. The children's reading knowledge of the 
color words before taking the program was almost 
non-existent despite previous specialtraining inthe 
words in the reading readiness class. A few of 
the children could pick out the words “yellow”? and 
“тей,” As before, an adult sat with the children, 
turning pages and supplying reinforcement for cor- 
rect answers. Error rates were extremely low, but 
latencies were often long. Runningtimefor a. single 
child was between four and six minutes for the total 
program of 74 frames. After finishingthe program, 
three of the six children could read all eight of the 
words and two could read sixwords without prompt- 
ing. One child was excused early in the program 
because of lack of attention and general negativity. 

The version of the program reported in Table I 
was tried out in November of the school year with 
teacher-selected kindergarten children. The teach- 
er was asked for above-average children. Three 
boys and three girls went through the program this 
time. Two of the boys learned all eight wor ds at 
the end of the session, and one boy and two of the 
girls were unable to read any of the terminal be- 
havior words. There appeared to be а good rela- 
tionship between the child's ability to identify the 
colors readily, a behavior which most с hildren 
learn in kindergarten, and the success of the pro- 
gram. The two girls who failed to learn any of the 
Words had considerable trouble in naming the colors 
on sight at the beginning of the program. None of 
the kindergarten children could read any of the 
words at the beginning of the program. 

Approximately 15 children have been run at var- 
ious times since the kindergarten tryout. These 
subsequent tryouts have used students with differing 
amounts of experience with words and letters. These 
children have been in the five and a half to six and 
а half year-old range. The results obtained lead to 
the conclusion that the program functions as an ad- 
equate teaching device when, and only when, the 
control stimulus to be transferredor extended from 
(a color) elicits the naming responseto be attached 
tothe new stimulus. The child should be able to 


identify all eight colors promptly, with very little 
or no confusion between colors. This is the initial 
required repertoire for whichthe program was con- 
structed. 

A further point to be made isthatinthe course of 
the usual kindergarten and first grade Schooling, the 
age range at which the studentis ready for the color 
names program varies a great deal, and the period 
during which the program has optimal utility seems 
to be rather short for a given child. It is also 
worth noting that the program works best when in- 
dividually administered by an adult who supplies 
prompt and consistent reinforcement or feedback. 
The program, however, might be usable with a 
group of students as ‘‘seat work." It seemed de- 
sirable for the student to have a copy of the program 
for some period of time after taking it if he showed 
any interest in practicing on his own. Another ver- 
sion of the program which wasfoundextremely use- 
fulis one which splits the eight color words into 
groups of four as described earlier. 


Discussion 


The color names reading program as it has been 
described is an example of how learning to sight- 
read may be programmed according to certain ba- 
sic notions of learning theory. This Should not be 
taken to mean that sight reading is recommended as 
a method of teaching a large reading vocabulary. 
Indeed, other kinds of training suchas phonics 
seem to have much more to recommendthem, since 
such training provides a student with behaviors that 
facilitate learning new words on his own. Asa 
small first experience in reading, however, some 
sight reading training provides the student with an 
additional experience with words whichtakes amin- 
imum of learning time and often gives the student 
an encouraging contact with reading. Many teach- 
ers, infact, make a point of teaching color names 
early in first grade because they are easy to teach 
and are useful as directions with workbooks that 
require coloring. 

The discriminative transfer technique using lit- 
eral or behaviorally equivalent prompts can be gen- 
eralized to a variety of subject matters. Such a 
programming procedure could be used wherever it 
is desirable to attach or transfer an existing set of 
discriminations to a new set of stimuli. Using the 
literal prompt system, a pre liminary number 
names reading program for numbers up to 100 has 
been constructed. Children in first grade usually 
learn first to count, then to verbalize the numbers 
in response to the sight of the arabic numerals, 
and finally to read the printed number words. Such 
sequential acquisitions serve as the basis for se- 
quential transfer programs. An early frame from 
a number reading might look like the small figure 
at the top of the next page. 

Much of elementary reading training could be 
carried on with the simple paradigm of the literal 
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prompt discriminative transfer program. Words 
such as ‘‘dog,’’ “саб,” and *house" all evoke vo- 
cal responses when the real objects, or pictorial 
representations of real objects, are present. These 
representations offer vanishable prompts for read- 
ing programs. Verbs, too, can often be prompted 
in a variety of ways; for example, a series of hu- 
man stick figures engaged in various activities such 
as sitting, running, and eating, could be used as 
prompts in association withthewritten verb. 
Through instructions, the child taking such a verb 
program could be given a *'set" to responding with 
verbs. Such stick figure prompting has, in fact, 
been employed in language texts (2). Discrimina- 
tive transfer programming offers а start or de- 
Scribes one aspect of reading programming, but, it 
must be pointed out, by no means offers the com- 
plete answer. 

Other areas suitable for discriminative transfer 
programming include adult foreign language train- 
ing, shorthand training, and learning of code sym- 
bols and abbreviations. Тһе technique has already 
been used with a beginning electricity program de- 
signed to teach recognition of the symbols used in 
circuit diagrams. Іп general, the discriminative 
transfer procedure permits the extension of re- 
sponses from non-verbal to verbal stimuli or from 
verbal to non-verbal stimuli, depending upon which 
has been established previously. 

The color names program could easily be put 
into machine format. In a simple mechanical ma- 
chine, the student could respond to each frame by 
putting a check above, or a circle around, one of 
eight colored dots. After the student responds to a 
frame, the correct answer, revealed by alever- 
push, could be shown as a dot of the correct color. 
Automating the program would probably not insure 
that a correct spoken response becomes attached to 
each word, but there is little reason to think that it 
would fail to occur if the required initial repertoire 
were present and if the material were careful ly 
programmed. More elaborate machines, of course, 
could be constructed that would permit the use of 
a push button and other techniques of fading and 
vanishing. 

When the color names program is used in book- 
let form, the teacher functions asa page turner and 

reinforcement mediator. A short demonstration 
program such as this could be used by parents and 
beginning teachers as a training device which pro- 
vides a ‘‘feel’’ for programming and for the use of 
reinforcement. 


The teaching of reading by the use of such obvi- 
our prompts as color and form may seem like a 
mechanical process. Even if achild should learn to 
read the words taught in the “Learning Color 
Names’’ program, the teacher might be well justi- 
fied in asking if the child ‘‘really understands" the 
words he responds to. In one sense, the answer to 
such a question is ‘‘no,’’ because it is true that a 
child who has used thhs program may not be able to 
read color names when he sees them printed in dif- 
ferent scripts, nor will he ‘‘understand’’ that green 
is a product of the mixture of blue and yellow. 
However, if such understanding is desired, it should 
be behaviorally defined and brought into the pro- 
grammed sequence. 
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Pennsylvania) and to Mrs. Bertha Seaman of 
Overlook Elementary School(Baldwin-Whitehall 
Schools, Pittsburgh, Pennsylvania). These 
principals generously permitted and encouraged 
program testing in their schools, 

2. Now at Western Reserve University, Cleveland, 
Ohio. 

3. The version of the program reported here was 
developed as a research tool, and this report 
is intended primarily to describe the design, 
development, and early tryout of the program 
rather than to report on any large-scale class- 
room application. 
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The Ruleg System for the Construction of Programmed 


Verbal Learning Sequences" 


THE RULEG SYSTEM for the construction of 
programmed learning sequences is based on the 
premise that the verbal subject material which 
appears in a program can be grouped into two 
classes of statements which may be called ru's 
(for “гшев”) and ев’ (for **examples'"). Since 
learning to identify ru's and eg's is itself a prob- 
lem in concept formation, an attempt will be made 
to indicate the range of both ru's and eg's so as to 
show the distinction between the two concepts. The 
two concepts, interestingly, are not mutually ex- 
clusive. This “relativity” of ru's and eg's in 
which the same verbal or symbolic statement may 
Sometimes be both a ru and an eg has so far 
caused little difficulty. On the contrary, it has 
proved useful in constructing programs. 

No one has yet discovered a completely satis- 
factory definition of a ru, nor are the authors con- 
vinced at this stage that one is needed. A ru may 
be a large number of things: a definition (opera- 
tional or otherwise), a mathematical formula, an 
empirical law, a principle, axiom, postulate, Or 
hypothesis. The invariant feature of all pU 
that they are all statements of some generality, 
from which substitution-instances can be obtained. 
These substitution-instances are called eg's. 

An eg may also be a large number of things: а 
description of a physical event, а theorem or de- 
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duction of any sort, or a statement of relationship 
obtaining between specific objects, whether the ob- 
jects are physical or conceptual. The invariant 
feature of all eg's is that they are all statements 
of some specificity, derived from more generlized 
ru's. 

The clearest instances of ru's and their corre- 
sponding eg's come from mathematics. For exam- 
ple, the algebraic ча+р= + а” 1з а ги, which 
summarizes compactly а number of substitution- 
instances, ог ер’ such as “7 + 9= 2 + 7’ and 
«3.44 8.6 = 8.6 43.4." Again, the statement 
“Unsupported objects will fall toward the earth" 
is a ru, while “If I release my pencil,it will fall" 
would be an eg, as would “‘If I release my book, it 
will fall." From optics comes the ru that “Тһе 
angle of incidence is equal to the angle of reflec- 
tion." To exemplify, one may create eg’s such as 
«Jf the angle of incidence is 30°, the angle of re- 
flection is equal to 309.” Of course, these ru’s 
would need qualification to become acceptable, but 
the point should be clear that ru’s involve some 
generality, while eg’s involve specificity, albeit a 
relative specificity. That is, 342 = 2 + 3’ is in 
turn a ru for such eg’s as “3 stones + 2 stones = 
2 stones + 3 stones’’ and “3 mammoths + 2 mam- 
moths = 2 mammoths + 3 mammoths.'' The rela- 
tionship of the ru-eg concept to the preparation of 
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programmed learning sequences and the steps in 
the construction of a program are now described. 


Step 1. Specification of Criterion Behavior 


At this step the programmer must attempt to 
outline, as precisely as possible, both the respons- 
es he wants from the student at the end of the pro- 
gram, and the stimuli or cues in the presence of 
which the student will be expected to make these 
responses. Questions such as the following must 
be asked and answered. If the student is studying 
statistics, will he have to produce the formula for 
the standard deviation on his own, or will he al- 
ways һауе a book available? Is he being prepared 
to write a short essay comparing two statistical 
tests or will he be called upon to take a multiple- 
choice test at the end of the program? In general, 
with what stimulus support may himself while the 
instructor is assessing criterion behavior? Can 
the student use à book? Another student? The in- 
Structor? His notes? A table of integrals? The 
construction and form of the program will differ 
radically as a function of the criterion behavior 
chosen. 


Step 2. Specification of Subject Matter Rules 


At this point, the programmer should proceed 
to write down all the subject matter rules he can 
think of. In specifying subject matter rules, the 
programmer should first work without the support 
of books, notes, charts or advice. All of these 
will be available later. For a subject matter ex- 
pert such “асадетпіс” objects usually set off pat 
behavioral sequences which, although relevant to 
thetask, may preclude an initial free-ranging 
overall examination of the field. If the program- 
mer uses stimulus supports, he is apt to neglect 
rules he might otherwise have thought of. Every 
rule generated during Step 2 should be written 
down, preferably on separate cards to permit easy 
rearrangement. 


Step 3. Collection of Stimulus Support in the Form 
of Texts, Notes, vice, etc. 

The programmer should examine these tools 
systematically for the ru's which are to be built 
into the students' repertoire. He should also put 
these rules on index cards. At times the Subject 


matter may yield only eg's, in which case the pro- 
grammer may have to induce the rule himself. 


Step 4. Preliminary Ordering of Ru's 


The ru's of a program must be presented in 
some order, but the conventional order in which 
they are presented in texts is not necessarily the 
best order for learning. Writing the ru's on sep- 


arate cards will facilitate experimentations with 
ordering. Useful methods for ordering rules in- 
clude complexity (simpler ru's first), chronology 
(ordering of ru's in time, as perhaps in a history 
program), spatiality (ordering ru's in space, as 
perhaps in a geography program), and dependence 
on other ru's (in a statistics program where it is 
necessary to introduce ru's about the mean before 
ru's about variance are introduced). Care should 
be taken, at this point, to avoid omitting rules or 
leaving gaps which the student may not be able to 
bridge. 


Step 5. The Ru Matrix 


The ru's should not be set up in matrix form 
with the same ru's listed vertically and horizon- 
tally like this: 


Ru 1 Ru 2 Ru 3 
Rul 


Ru 2 


Ru 3 


The use of a matrix is based on the fact that 
the hallmark of the expert in a subject matter is 
his ability to interrelate the concepts in his field. 
The ru matrix permits a systematic method for 
examining the ru’s of a subject singly, in ordered 
pairs, triplets, and so on. Ru’s can be checked 
for similarities, differences, possible confusion, 
or any of a host of possible intra-verbal connec- 
tions. The matrix may also have some heuristic 
value, in that it may prompt comparisions or re- 
lationships not previously considered, e.g., ‘‘How 
are the laws of thermodynamics related to the 
laws of optics?” **How is what we know about 
bread mold related to bacterial growth?" 

The upper left corner of the matrix is reserved 
for operators for inter-relating the ru’s. A very 
general operator is the relator operator. Consider 
a matrix like this: are 


RELATE Rul Ru 2 Ru 3 


Cell no. 4, for example, would ask, ‘‘How is ru 2 
related to ru 1?” This might require a different 
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handling than cell no. 2, which asks, “How is ru 1 
related to ru 2?” If the order of relationship is of 
no consequence, symmetrically placed cells, such 
as 2 and 4, 3 and 7, 6 and 8, are redundant. If or- 
dering does matter, the complete matrix permits 
an examination of every ordered pair. Such or - 
dered pairs may be added to the axes as new row 
and column entries, thus expanding the matrix to 
include ordered triplets, quadruplets, etc. For ex- 
ample, the programmer could firstrelate pressure 
to volume in the “‘perfect’’ gas equation, then put 
this relationship on the axes of the matrix, and have 
a new cell relating pressure and volume to tem- 
perature. 

The major diagonal relates each rule to itself, 
and the cells on this diagonal can be reserved for 
definitions. Ru's can be defined in terms of some 
previous behavior which can safely be assumed to 
exist at sufficient strength in the student to make 
the definitions meaningful. 

Another operator which might be termed the 
discriminator operator is also useful. This oper- 
tor would ask for a statement of how two rules are 
different. This would set the stage for discrimi- 
nation training. 


Step 6. The Example Operator in the Ru Matrix 
and the Construction of Examples or the 
Program. 


This is a critical feature of the Ruleg System. 
The eg's which are constructed must meet a num- 
ber of criteria. These аге: 


1. An adequate number of eg's must be 
generated for each ru. Itis mainly through eg’s 
that the student will interact with the subject mat- 
ter. Eg's must be generated not only to exemplify 
a ru initially, but to provide later practice and re- 
view for that ru. 


2. The full spectrum of eg’s for a given ru 
must be considered, i. e. , special cases, limiting 
cases, trivial cases, examples with inadequate in- 
formation, and examples with redundant informa- 
tion. As a good rule of thumb, the first eg fora 
given ru should be the simplest possible non- 
trivial example. For instance, or a student S 
first example of the computation of a variance, use 
an N of 2, with simple integer Scores which pro- 
duce a mean which is also an integer. Any temp- 
tation to be complex or devious should be resisted 
at this time. Complex “‘test items’? come later 
when the student's behavior has been adequately 
strengthened. 


3. To insure adequate generalization оға 
ru the attached eg's must be as diverse as possi- 
ble and still exemplify that ru. Also, to insure 
adequate discrimination between ru's, eg's must 


be selected which resemble each other as nearly 
as possible while still exemplifying their respec- 
tive ru's. 


Step 7. Numbering the Cells of the Original Ru 
Matrix to Indicate the Order in which the 
Ru's are to be Presented and Exemplif. Ted 


It is often useful to start by using the major di- 
agonal (the intersection of each ru with itself) as 
definition frame. After that, a decision can be 
made regarding what ru’s should be compared and 
contrasted, or what pairings of rules should be 
omitted. The programmer can now number in se- 
quence all cells to be dealt with. 


Step 8. Assembling the Ru's and Eg's into Frames 


A frame consists of the stimulus material 
available at any given moment which demands one 
or more responses on the part of the student. The 
student makes his response, and then receives 
immediate feedback as to the correctness or in- 
correctness of his response. Frames are con- 
structed by judicious selection and combination of 
ru's and eg's from the population of ru's and eg's 
which has been constructed. To provide for re- 
sponses on the part of the student, words,phrases, 
numbers, symbols and so on are omitted from 
certain ru's and certain eg's. Such omissions 
may be indicated with a line, like this: : 
For instance, the programmer could give ап іп- | 
complete ru in this manner: The angle of inci - 
dence is equal to the angle of , where 
“reflection’’ is to be provided by the student. As 
an example of an incomplete eg involving numbers, 
there might be: 8 - 32 3 + , with “8” being 
the appropriate response. The choice of the re- 
sponse to be made by the student is obviously an 
important matter. It often is easier for stylistic 
reasons to omit certain words from a frame than 
others; however, it is necessary to consider just 
what the response required by the student can ac- 
complish toward producing the criterion behavior. 
Does the response review or strengthen the re- 
sponses elicited in past frames? Does it help set 
him up for the next step? Does it “test” what he 
should have learned previously? 

To designate the incomplete ru’s and eg’s, the 
symbol “С,” (read tilde) can be added over the 
symbols for rules and examples, like this: ru 
(ru tilde) and SÉ (eg tilde). For terminal situa- 
tions in which criterion pehavior is being called 
for in the presence of minimal stimulus support, 
use two tildes over the ru and eg symbols, like 
this: ті, Е. An example of а would be: 

The machine formula for the Pearson-product- 

moment correlation coefficient is ` 
No other hints or aids would be available to the 
student. An example of an eg would be: 
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The variance of the following scores: 3, 7, 4, 
2, 9 is : 
Again, no other ru's or eg's which might Serve as 
prompts must be present, or we would have an eg, 
not an eg. 


Frame-Types 


A list of frame-types described in these terms 
is presented below. Frames of this kind have 
been found to be useful in program construction. 


1. Ru «* eg + eg. This frame-type is an ex- 
cellent method for the economical introduction of a 
new ru. The ru + eg + 6g can have the student 
working an example of a brand new rule on the very 
first frame in which he is exposed to it. By giving 
him an explicitly stated ru, and one (or more) 
carefully chosen eg's before calling for a response 
by means of an 6g, а powerful prompt is set up 
which makes an error in responding most unlikely. 
To illustrate a ru + eg + eg frame: 

To multiply by 100, move the decimal 

point two places to the right. (ru) 

For example, 100 x 2.843 = 284. 3 (eg) 

Also, 100 x 7. 374 = . (6E) 
It is best not to introduce more than one ru at a 
time, or to introduce complex ru's too early. The 
first example in many texts is often much too com- 
plicated. The primary consideration is that pre- 
requisite behavior must be adequately strengthened 
before the student can proceed. 

While a frame with ru ер + 6g is a most effec- 
tive method of presenting an initial frame for a 
new ru, other combinations are certainly possible. 
For example, ru 66 may be useful, but at least 
one completely worked-out example may be neces- 
Sary to permit the working of the first eg with ease 
and efficiency. Why not eg + @? This would be 
an analogyframe. The chief characteristic of an 
analogy is that one must induce the ru from the 
first eg, and then apply it to the eg. Rather than 
run the risk of having the student induce an incor- 
rect ru, it may be preferable to state the ru for 
him explicitly. Incidentally, this same philosophy 
may lead to the rejection, in general, of the induc- 
tive presentation which might be symbolized as: 
ебі + еб. . egy + ru. Неге a large number of 
eg's are given and the student is asked to state the 
ru involved. 

After the student can recognize and apply a ru 
with proficiency, then eg + 66 frames are accept- 
able. But until that time, it may be often hazard- 

ous and slow to approach a ru through induction 
and incidental learning. The student should adopt 
the expert programmer's carefully chosen state- 
ment of a ru rather than use his own halting induc- 
tion-derived statement. Examples in textbooks 
are typically too few in number or too restricted 
to indicate the full range of the rule in question. 


This makes it possible for a student to induce what 
is essentially an incorrect rule, but one whichhap- 
pens to fit all the examples present. This consti- 
tutes another possible source of danger of the in- 
duction process. 

After initial introduction to a ru through the ru4 
eg + eg formula, it is tempting here to write: ru + 
eg + eg + ёк, to indicate that опе or more complete 
examples can be used in the initial frame. Depend- 
ing оп the ru involved, it is often a good idea to pro- 
vide multiple complete eg's so the student will be- 
gin to generalize on the ru and not have the ru at- 
tached to only a small number of eg’s. A number 
of variations have proved useful for the subsequent 
frames. The following frame-types represent only 
a small number of the possible ru-eg combinations, 
but they appear to be useful in frame construction. 


2. Ru+Tu. Ru's typically contain technical- 
vocabulary words and terms. These are words 
which may be quite new to a student, e.g., “регі- 
gee” in astronautics, or they are familiar words 
with new meanings to be attached, e.g., “рорша- 
Чоп”? in statistics. Students often are slow in add- 
ing such words to their active vocabulary. If a 
technical-vocabulary word is defined in one frame, 
and then called for after subsequent frames, it has 
been a common experience that the student will of- 
ten fail to come up with the word. It is better 
technique to force attention to the technical-vocab- 
ulary terms by calling for them explicitly soon af- 
ter they are introduced. To illustrate: 

The Commutative Law permits the inter- 

change of members connected by a “+ ” 

Sign. (ru) So if you found a case where 

two numbers had been switched about a 

“© +?” sign you could be sure that the 
had been used. (fu) 

3. Ru+@. In later frames in a program, 
Stimulus support can be gradually withdrawn by 
giving only the ru before giving the 6g. Likewise, 
the previously mentioned analogy frame (eg + 6g) 
represents a weaker prompt for the eg than does 
ru + eg + eg. 


4. Есті. An induction frame which can be 
used more safely when the student has the ru at 
some strength. 


~ 
5: Еч] fü». Such frames can be used to 
compare and contrast two different ru's as part of 


the discrimination training involving the ru's. This 


сап obviously be extended to fü, + fu; + Tug and 
so on. 


6. Egy + . For comparing and contrast- 
ing examples о S different ru's, or for demon- 
strating how two ) ru’s can be applied in turn. Again 
the extension to €, + еро + Gq, etc., is obvious. 


(Note that frame-types 5 and 6 would be produced 
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by the matrix technique in the programmer's effort 
to deal systematically with the important intra- 
verbal connections which make up the subject 
matter. 


T. fg. This is the terminal frame of a 
series in which criterion behavior is performed 
in the presence of minimal cues. For example, 
the student might be asked to ‘‘Compute an analy- 
sis of variance on the following three groups of 
scores...’ or ‘Describe the acetylcholine 
cycle. 2 


8. Ru. Although the eg frame is more 
basic, in the sense that to respond appropriately 
to an eg, the student must have command of the 
ru or ru’s involved, on occasion the programmer 
may wish to call for a statement of a specific ru — 
to exemplify, **What is the second law of thermo- 
dynamics?" 


9. Eg. There is another concept in the 
ru-eg framework. This is the concept of the 
“false example" which is symbolized: ёр ( read 
“ере bar"). The rationale for the eg is this: Іп 
addition to the rich intra-verbal associations 
which the professionally competent person in some 
fields of learning has, he possesses an additional 
ability. This consists of his facility in detecting 
and correcting errors, even though he might never 
make those errors himself. Such **error-detect- 
ing" behavior can be taught like any other behav- 
ior, i.e., by giving the student a chance to prac- 
tice it and be reinforced for performing appropri- 
ately. Every field of knowledge has its own pecu- 
liar booby-traps and sources of confusion.. Rather 
than have the student fall into these errors, and 
rely on uncertain punishment effects to correct 
his behavior, another technique is to apprise him 
(the eg), and indicate that he is to correct it. 

Examination of the protocol following а pro- 
grammed learning session will help point up the 
sources of student error, and subsequent pro - 
grams can incorporate eg's at the appropriate 
spot. An example of an еб: 

In his homework а student had the following 

equation: 

3 (5 + 2) = (3 + 5) (342). But you know this 

is a mistake. 

The correct equation is: 3 (5 + 2) = 

Answer (3 х 5) + (3X 2) 

Another use of the eg is in getting the student to 
discriminate where and where not to use technical- 
vocabulary words. For example: 

Names of people and places are termed ; 

proper nouns. For example, “Chicago” is 

the name of a place, 50 itis a proper noun. 

** Tree?! is not the name of a person or 


place so itis nota | ^ -- 


Step 9. Assembling Frames of Various Types 
into a Program Using the Numbered Ru 
Matrix as a Prompt. 


Among the problems to be faced at this step are 
the following: How many frames should be used 
for each ru? How should these frames be distrib- 
uted throughout the program? The procedure rec- 
ommended here is to lead off with a ru + eg + 6f 
frame, and to terminate when the student can deal 
effectively with 8 frames (and ra frames). In be- 
tween, the general principle is to withdraw stimu- 
lus support “‘gradually,’’ with the use of such 
frame-types as ru + êg, eg + 96, and eg + fü as 
mentioned before. Actually, prior to the first ad- 
ministration of the program, the optimal number 
and distribution of frame-types within a sequence 
can only be estimated. 


Step 10. Tryout and ‘‘Item Analysis’’ of Results. 


Only after a frame-by-frame analysis of stu- 
dents’ recorded responses to the program can 
the effectiveness of a program be assessed. If 
the students are dealing effectively with 6, 
then the program has done its job and further 
modifications can be in the direction of making it 
more efficient without interfering with its effec- 
tiveness. If the student is not producing the spec- 
ified criterion behavior reliably, then steps must 
be taken to add more frames, redistribute the 
frames, clear up ambiguities, and so on. One of 
the chief strengths of the programming technique 
is its provision for analyzing and revising mate- 
rial on a detailed empirical basis, i.e., the re- 
sponses of the learner. 

Step 11. Revising the Program on the Basis of 
Students’ Responses and Comments. 

Students should be encouraged to record their 
comments and criticisms on particular frames as 
as they proceed through the program. After ini- 
tial administration of a program, a large propor- 
tion of the corrections will follow quite obviously 
from the nature of the errors made. Some will 
be a result of the simple typographical errors; 
others will be produced by easily-corrected am- 
biguities in the text. Failure to come up with à 
correct term or solution implies the necessity of 
additional prior strengthening of the concept in- 
volved. 

In the process of program revision, there are 
a number of booby-traps. The first is the danger 
of over-prompting. Ifa student cannot come up 
with the word “entropy,” the solution is not to 
give him “епігор ,” and have him guess the 
letter “y.” With adequate prompting, а jungle 
savage can get through a course in differential 
equations. It is easy to correct a program by 
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adding prompts, readministering it, and finding to 
one's delight that the errors drop practically to 
zero in the corrected frames. But the proof of the 
pudding is in the Ss; only if the student can pro- 
duce criterion behavior with minimal cueing has a 
revision succeeded. 

A second booby-trap is the assumption that the 
student will respond appropriately to supplemental 
instructions, comments, and material which ap- 
pear on the frame but are not involved in the actual 
production of the response to that frame, for exam- 
ple, “learn this,"' ‘‘read each frame carefully, 
etc. Such techniques are probably not without ef- 
fect, but they have a hard-to-control differential 
effect on students of varying compulsivity. Amore 
realistic rule-of-thumb is: The student will attend 
only to those stimuli which are necessary to pro- 

uce the correct response on any given frame. 
Exhortations to learn we not to make errors 
are tenuous sources of control of student behaviors; 
behavior important enough to be strengthened at all 
should be dealt with by evocation and reinforcement. 

The third booby-trap is that as a result of for- 
malizing a system of program writing, such as 
Ruleg, the responses of the student may fall under 
the control of the wrong set of stimuli. For exam- 
ple, if all technical-vocabulary words are under- 
lined the first time they are presented and always 
called for later on in the same frame, the produc- 
tion of this word might soon fall under control of 
the underline, rather than under control of the 
textual material present. On the other hand, for- 
malization is not without its benefits. Observa- 
tional evidence indicates that some sort of a 
“learning how-to-learn’’ phenomenon can take 


place. That is, once a student gets into the swing 
of the ru + eg + 66 formula, he begins to grasp new 
ru’s very quickly because he can begin to anticipate 
how the presentation of the rule will be developed, 
If he finds that he is always required to state each 
ru at some later frame (as іп a fü), this may lead 
him to attend more carefully to the ru as it is first 
presented. 


Step 12. Repeating the Administration and Revi - 
sion Procedures unti rogam is Pro- 
ucing Criterion Behavior Reliably an 
Efficiently. 


This is the last step in the Ruleg System of pro- 
gram construction. 

This early attempt to formalize and make explicit 
some of the rules of programming will be judged 
by the same criterion used to ascertain the effec- 
tiveness of a program, namely, by results. Like 
а program, it will be edited, revised, supple- 
mented, or discarded on the basis of the results it 
produces. In particular the authors would like to 
dispel any impression that they are irrevocably 
committed to a Ruleg approach, or that they feel 

Ruleg will solve all programming problems. The 
system contains a large number, of gaps and 
guesses. But in programming work it has been 
found to be a helpful prompting system and is an 
improvement over a more artistic manipulation 
of the subject matter. The Ruleg System also 
provides a way of communicating to others in an 
operational fashion, one of the ways in which pro- 
gram construction can be carried out. 
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Implications of Current Research and Future Trends" 


TO MAKE a useful and valid extrapolation of cur- 
rent research on programmed instruction is like 
parlaying a penny into a million dollars. In theory, 
this is possible, but the odds against itare certain- 
ly high. If one has to forecastfrom today’s findings 
to tomorrow’s activities and conclusions, the ob- 
vious conservative strategy is to make many pre- 
dictions in the hope that at least one obvious—and if 
lucky, a significant one—will be verified. 

A seemingly reasonable initial prediction is that 
auto-instructional programming and teaching ma- 
chines seem to be here to stay. 

While the possibility of their contribution to ed- 
ucational research seems clear and potentially sig- 
nigicant, their acceptance anduse in the classroom 
is another matter since they seem to be unaccept- 
able to many classroom teachers. This is suggest- 
ed by data obtained in a recent study by Professor 
Allen Barton of Columbia University's Bureau of 
Applied Social Research. Heasked anational sam- 
ple of elementary school teachersthe following 
questions: 

**Have you heard anything about teac hing ma- 
chines?' Yes No 

“If yes: From what source did you hear about 
them? What is your opinion of the possible effects 
of teaching machines 2?” 

Replies were received from about one third of 
the teachers; consequently, we can suspect bias in 
the sample, for those who replied would tend to be 
more motivated and probably better informed 
teachers than the average. 

Incomplete tabulations revealedthatover 60 per- 
cent had heard about teaching machines, and that 
approximately one fourth who had heard were ofthe 
opinion that they had favorable effects, but about 
the same number were undecided or had no opinion. 
Of those who had heard, two sources: а) ‘General 
reading, Т.У.” and b) ‘professional reading 
courses, lectures” were the two most frequently 
reported sources. The third source in frequency 
was “Other teachers, teachers’ organizations. a 

**A supplementary sample of better schools co- 


NENNEN  - .— 
* Footnotes will be found at the end of the article. 
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operating with the Associated Public School System 
gives a considerably lower rate who had not heard 
of teaching machines—about 22 percent—with about 
the same proportion of favorable and unfavorable. 

They had more often heard about them from other 
teachers and from professional courses. ’’ 

Some 61 to 69 percent of teachers sampled said 
they had heard about teaching machines, and the 
higher the grade taught, the more favorable was 
their reported reaction. 

The obvious prediction is that more teachers will 
hear about teaching machines. How this will come 
about is not certain, but a form of com munication 
that seems most promising as through teacher in- 
stitutes and small work and study groups. It would 
seem likely the notions would be presented to key 
teachers first and that they would supply informa- 
tion to their associates. Hopefully, they would also 
advise the school administrator about developments 
of use to the schools. 

Since only about half of the teachers heard about 
teaching machines, about one fifth of the total sam- 
ple had a favorable attitude toward them, it is quite 
apparent that their use atall, let alone correctly, 
is going to develop slowly at best, unless specific 
action is taken to catalyze the process. 

One problem that can be anticipated is getting 
teachers to accept, let alone use, programmed in- 
structional materials and t eac hing machines. 
Special institutes and regular courses will have to 
be organized to cope with it. An attitude study of 
seniors in an Educational Psychology class, (Stolu- 
row, 1962) revealed marked differences between 
their opinions and attitudes from those of students 
in an introductory course in psychology. For ex- 
ample, these two groups differed significantly in 
their opinion about the possible development of an 
objective science of behavior. The fledgling teach- 
ers did not feel that such a science could be devel- 
oped. They felt that psychological research which 
might allow people to controlothers would be unde- 
sirable. Furthermore, and closer to home, they 
felt that teaching machines were athreat to teachers 
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and would replace them. If fledgling teachers hold 
to these opinions, the future use of programmed in- 
Struction and teaching machines will be limited at 
least, and possibly prevented from becoming a 
Standard part of a school program. 

The need is for better dissemination of informa- 
tion and preferably at a face-to-face level. In fact, 
if programmed instruction and teaching machines 
are to be used extensively outside of the few major 
cities, then there has to be a systematic effort to 
teach both new and experienced teachers about 
teaching machines and programmed instruction. The 
prediction is that we will see an increase in these 
activities- workshops, institutes, audio-visual pro- 
grams, and the like, Teachers will need to be 
trained to use programmed materials and teaching 
machines, let alone to develop their own programs. 
In order to train the teachers, courses must be es- 
tablished in which they must study their capability 
and determine the proper place of machines and pro- 
grammed self-instruction inthe educational setting. 

My prediction, based upon these data, is that at 
both national and local levels, the initiative taken 
to teach teachers about teaching machines and pro- 
grammed instruction will be increased. A larger 
number of training institutes and demonstration pro- 
jects will be established in the next few years to do 
this with established teachers. Hopefully, colleges 

and universities with teacher training programs al- 
so will add more information to every new teacher's 
curriculum sothat successive classes willbe better 
informed and the need forspecial institutes lessened. 
Why are auto-instructional methods and machines 
here to stay? The reason seems to lie in the facts. 
The results produced,while tentative andthe studies 
often of poorer quality than we hope will be conduct- 
ed in the future, seem to be clear enough to makea 
prophecy. It is that these methods will be found 
useful, and therefore will be developed to the point 
where they have real staying power. Much of the 
initial research was conducted to show that these 
methods and materials modified student behavior in 
desired ways. Even though the ‘‘no difference" 
studies comparing programmed self-instruction 
with live teachers have been the rule rather than the 
exception, they do show that, in general, students 
learn. Whether or not the method used is called 
teaching is another problem. The results are clear 
that a student's knowledge can be increased in this 
way. Furthermore, they showthat increased know- 
ledge can be produced in a variety of different stu- 
dents, with a variety of different materials and with 
a variety of different procedures— program and ma- 
chine as the sole source of information; program- 
med materials used as a supplement to a regular 
course of instruction. Тһе variety of students who 
show these changes is broad and includes the men- 
tally retarded, normal and giftedchildren, the deaf, 
college undergraduates and graduate students, and 
adults working on job training situations. 
The most consistent finding is that students learn 


at least equally well compared with other met hods 
of instruction. Furthermore, they often learn in 
less time by means of auto-instructional methods. 
These studies and comparisons have emphasized, if 
not introduced, a concept into education which may 
be expected to play a more important role in thefu- 
ture thinking of educators and teachers—- namely, 
the concept of educational efficiency. This concept 
enters whenever one asks if a method can teach the 
same amount of information in less time. In the 
past, with each new teaching method, the edu- 


' cational research question has been “Does this 


method result in more learning than a traditional 
method?" However, the comparative study gener- 
ated by this question, the critical point overlooked 
is that the samples drawnto represent each method 
may not have been representative or typical of the 
method, so the study only provides data about the 
specific groups compared and not about the general 
question of which method, on theaverage, produces 
more learning. To answer this question, it is ne- 
cessary to sample the methods sufficiently to as- 
sume that the average effects are represented. 
Over and beyond this sampling problem is the ques- 
tion of efficiency since two methods could produce 
the same amount of learning, butone of them might 
do this in twenty minutes, and the other in one hour. 
Another possibility is that the method which pro- 
duces the greater average amount learned also 
might require more time to achieve this average, 
but the real question is efficiency and not amount. 
One feature of research on auto -instruction I 

hope will not be with us in the futureisthe compar- 
ative study. Currently in auto-instructionalre- 
search there is blind repetition, if not stubborn per- 
sistence in the use of the comparative methods of 
research design. Film and TV research have shown 
this same pattern, Interestingly, this type of study 
is inappropriate when we know very little about а 
complex phenomenon; yet it seems to be at its peak 
in its compelling quality at the very time it is least 
likely to be useful. With the com parative study, 
each of the specific conditions being compared is 
not the condition, but rather a sample of just one 
condition from a population of essentially infinite 
variation. Consequently, what is most needed in 
the beginning is the parametric study rather than 
the comparative study. We need to know what we 
are comparing in order to make a meaningful com- 
parison, and we don’t know what we are comparing 
unless we know what the population of variations is 
like. Once we know enough about the population of 
variations that we call auto-instructional pro- 
gramming, we may see why a comparative study 
could not possibly lead to results that would aid in 
making a useful decision. If we know anything, we 
know that a comparative study cannot produce gen- 
eralizable results. A repetition of such a study 
could produce just the opposite results, since the 
second time we might sample thetwo sets of condi- 
tions in such a way that the two we obtained for 
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comparison would, within their r e s pective popula- 
tion, be at opposite extremes of their respective 
sampling distributions from what was obtained in 
the first study. The real fact may be that the aver- 
age effect produced by each method is equal to that 
of the other, or that one produces a more homogen- 
eous set of effects than the other, but that their av- 
erage effect is the same. 

My prediction and firm hope is that the compar- 
ative study in which a teaching machine program is 
compared with live teaching will become extinct. 


Awareness of the problems with the ty pe of study ` 


seems to be growing, and the prevalence of the de- 
sign waning. 

The other side of this prediction is that in the 
future we will see more studies inwhichthe purpose 
is to determine the relative effectiveness of various 
methods, techniques or conditions of programmed 
instruction. Through systematic study of different 
programming methods, principles and conditions, 
it will become possible to indicate the important 
conditions that determine the effectiveness of а pro- 
gram and/or machine. 

The main staying quality of programmed in- 
struction that will be recognized more and more is 
its capability of controlling conditions which here- 
tofore it was not possible to control. With pro- 
grammed instruction and machines, it is possible 
to be quite explicit about either a m ethod or a 
teaching sequence. Added to this advantage is that 
of reproducibility of the conditions. They make it 
possible to study teaching itself in a way that we 
could not do in the past. Involved is the possibility 
of doing research on methods independently of the 
teacher's personality; later on we can study the 
methods when combined with different personalities 
to determine what happens to their effectiveness. 
While there has been considerable interest in this 
problem in the past, upto now, the capability for 
studying it did not exist. Since it does now exist, 
the prediction is that we will see studies of how 
these two sets of variables interact with one anoth- 
er. This will make a science of teaching a genuine 
possibility. 

This leads to the next prediction that the teach- 
ing machine will contribute to a theory of teaching. 
As a research laboratory, the teaching machine has 
the potentiality of providing the necessary controls 
for studying variables of teaching strategies, and 
so I predict that we will see one or more theories 
of teaching emerging in the near future with reliable 
and valid data to support them. Hopefully, teaching 
theory may become as respectable andactively pur- 
sued an objective as learning theory, in the not too 
distant future. pier: 

Related to this last prophecy is the pr ediction 
that we will see increasing efforts to revise and re- 
vamp more courses and curriculums, particularly 
atthe college level. The cudgel is being raised ЭЭ 
gainst traditional conceptions of the «structure of 
different subject matters. We need data to provide 


guidelines. The prediction is that the courses will 
be changed to produce cognitive structures. The 
subject matter expert who tries to program a course 
he feels he knows very wellalmost invariably comes 
to new, and apparently useful, conceptions about 
both the content and the methods of presenting it 
most effectively. Even if auto-instructional meth- 
ods and machines did nothing else than produce bet- 
ter texts than we have, as a result of reorganization 
of content, they will have more than compensated 
for the troubles and costs produced. 

One of the major changes that can be anticipated 
here is the clear recognition by the subject-matter 
expert that the content of acourse is specifiable and 
in fact, needs to be specified at a more detailed lev- 
el than is now customary if it is to be organized for 
efficient teaching. Related isthe prediction that the 
relationship between the content of an existant 
course and the possible content will be viewed in the 
future as the statistician views the relationship of a 
sample to a population. 

As things stand now, some auto -instructional 
programmers (not the subject-matter experts) con- 
sider a program as if it were a population or uni- 
verse rather than as a sample from a larger popu- 
lation. It would seem that the future portends for 
these people that they will discover sampling theory 
that they will consider a program asa sample from 
what might be better thought of as a very large, if 
not infinite, population of steps. 

In teaching arithmetic, for example, it is not 
possible to include steps containing all numbers, 
let alone all number combinations for teaching the 
operations, etc. Not only is the stimulus material 
such as the numbers an infinitely large population, 
but also the terminal behaviors are too numerous 
to teach in many cases. Consequently, the problem 
is to specify both the stimulus properties of the 
program steps and the responses— both the transi- 
tional or component, and the terminal, and to make 
explicit the sampling rules to select the materials 
used, 

Another predictable outcome is a better under- 
standing of the effects of sequencing ideas andin- 
formation when teaching. It is a strange paradox 
that we readily comprehend and use the notion that 
different physical routes tothe same goal are often 
very different in their cost, their difficulty, or 
their affective value, but have avoided studying the 
analogue in the teaching— the significance of organ- 
izational and sequential factors. One possible rea- 
son for the lack of data might be that the conditions 
required for the proper study of organizational fac- 
tors just were not available. Now that they are, 
the prediction is that this important question will 
be examined. 

This problem can be related tothe sampling no- 
tion mentioned earlier in thatthe programming can 
be seen as the application of a set of rules for or- 
dering knowledge by selecting from the population 
that set which we decide to use as the program con- 
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tent ina particular way. From this point of view, 
any teaching strategy can be translated into a set of 
rules for sequencing steps of a program. It is in- 

teresting to consider the possibility that two very 
different-sounding teaching strategies could never- 
theless generate the same sequence of steps. 

To pursue this somewhat more, let us assume 
that we have decided upon a content to teach which 
has two critical dimensions—the concepts and the 
illustrative examples. То represent this content, 
we can generate a two-dimensional matrix: Concepts 
X examples. If we think of the concepts as the rows 
of the matrix and the examples as columns, then 
each variation of these dimensions identifies a set 
of cells of the surface. Once this degree of speci- 
ficity has been achieved, a programming or teach- 
ing strategy can be translated into a sequential 
sampling rule discribing the order inwhichthe cells 
of the matrix were used. The ‘‘Ruleg’’ sytem of 
Evans, Glaser, and Homme (described earlier in 
this issue), for example, has the sampling rule 
equivalent to that of the deductive teaching strategy. 
The total content is specified as a set of rules and 
examples and these are then organized into a pro- 
gram so that following a rule is one or more exam- 
ples. However, it is not sufficient to specify just 
this order, for it also is necessary to specify the 
order in which the rules have to be programmed 

and the sequence for selection of examples. 

The question that arises now is what does this 
say with respect to the future. The answer is that 
auto-instructional methods will be used in the future 
to provide data for a theory of content sampling to 
achieve specific educational objectives. In other 
words, research on this problem willprobably show 
that there are many, not just one, sampling se- 
quences that are effective and that particular ones 
are better than others for a specific educational ob- 
jective. It seems reasonable to predict that se- 
quential arrangements will be found to make a dif- 
ference in the resulting cognitive structure of the 
Student which will be reflected in his behavior in 
ways that are educationally significant. 

Some data that relate to this prophecy were ob- 
tained by Cartwright and Stolurow?, who completed 
a study in which the same setof program steps was 
organized into two different sequences. Іп one se- 
quence, the numerical value of the fractions was 
ordered consecutively so that they went from lar- 
ger to smaller, or vice versa, in teaching a par- 
ticular concept. In the other Sequence, however, 
the same set of materials was presented so that the 
sequence was mixed up. Inother words, with the 

mixed sequence, the students could not ancitipate 
what fraction would come next, if he attended to 
either the denominator or numerator, while in the 
Sequence using the consecutive order, this was 
possible. To use this class cue, the student would 
have learned the natural order of the numerals. 
The use of a previously learned skill in this way is 
an example of the transfer of training in which the 


transferred skill mediates the new performance, 

The students in this study were high school age 
(educable mentally handicapped) students enrolled 
in a special education curriculum, Division of Spe- 
cial Services, Evanston Township High School, Еу- 
anston, Illinois. The study wasanextensionof ear- 
lier work (Detambel and Stolurow, 1956; Stolurow, 
1956; Wulff and Stolurow, 1957; and Stolurow and 
Bergum, 1958) concerned with the effects of par- 
ticular sequence differences on pe rfor mance in 
learning tasks. 

One of the earlier findingsinthis series was that 
the stimulus sequences which led to the most rapid 
learning did not produce scores that were correlat- 
ed with standardized test scores of general intel- 
lectual ability. Examination of the scores of the 
different ability groups on the achievement test 
(Detambel and Stolurow, 1956), revealed that the 
lowest ability group under the sequence that pro- 
duced the better average performance obtained 
mean scores that were equal to that of the highest 
ability group. However, the lowest ability group 
under the sequence that produced the lower mean 
Score obtained scores that were significantly lower 
than those of the highest ability group. In other 
words, the best sequence did for the lowest ability 
group what the highest ability groups could do for 
themselves regardless of sequence. Others have 
also found that the more effective program sequen- 
ces result in essentially zero correlation between 
achievement scores and MA or IQ. This does not 
mean that all of the intellectual requirements of a 
learning task can be eliminated by proper sequenc- 
ing. It merely says that if the students have the 
minimum abilityto master the task, then the most 
effective sequence will produce a set of achievement 
Scores that will not be correlated with general in- 
tellectual ability, since the sequence will have com- 
pensated for these differences. 

In another study, using a program to teach a 
sight vocabulary to EMH students at Lincoln State 
School (Stolurow, 1962), sequencingwas again found 
to be important. Here the study was concerned 


. With intra-step sequence and the ones used deter- 


mined differences in learning and in retention. A 
prompting sequence was uniformally better for 
learning under the two degrees of overlearning that 
were used, but a confirmation sequence was better 
than the prompting sequence for producing greater 
retention after the higher degree of overlearning. 
In the Evanston study, a different effect of se- 
quence differences was sought. It was to find out 
whether two sequences that resulted in comparable 
levels of achievement differed in the requirements 
placed upon the learner’s abilities. The group of 
40 students was divided into two sub-groups, each 
of which studied a different sequence of the identi- 
cal steps. Тһе only difference between the two 
programs was the sequence in which the pages were 
put together. The error rates were 7 and 12 per- 
cent for the two groups, and there were no signif- 
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icant differences in mean performance on the a- 
chievement test they took immediately upon comple- 
tion of the program. Thus, we can say that the two 
programs achieved the same educational outcome 
in terms of amount taught. But, did each do it by 
placing the same requirements upon the students? 
The answer is no. The two tables reveal some 
of the differences. 

For example, in Table I, the full-scale IQ (FSIQ) 
and total language correlated significantly (г = . 614 
and . 634, respectively) with the scores on the im- 
mediate post-test for students given the mixed se- 
quence program, but this score did not correlate 
with the immediate post-test scores of students giv- 
en the consecutive sequence program. On the other 
hand, the scores obtained on the consecutive se- 
quence program correlated significantly with read- 
ing comprehension and arithmetic fundamentals. 

With other measures of achievement, this same 
pattern was found, one set of scores correlated sig- 
nificantly with post-test scores but the other set did 
not. The consecutive sequence program seemed to 
require more specific abilities like arithmetic rea- 
soning and arithmetic fundamentals. The mixed 
sequence program, onthe other hand, seemed to 
require more general ability such as measured by 
MA and IQ. Table II provides additional corrobor- 
ative data based upon the 21-day delayed transfer 
test. 

These data augur the possibility of a truly indi- 
vidualized instruction, a topic about whichthere has 
been much talk but little research indicating just 
how one could implement the idea. These data, if 
supported by other findings, suggest that one way to 
individualize instruction would be to sequence a set 
of steps in such a way as to make maximum use of 
the individual’s abilities. Witha computer-based 
teaching machine, for example, the steps of a pro- 
gram could be stored and each student could then 
insert into the machine his ability profile. This in- 
formation could determine the sequence in which 
the computer would sequence the steps for him. 
This concept of programming is to be contrasted 
with linear (Skinner, 1958) and intrinsic program- 
ming (Crowder, 1960); it might be called ability 
pattern programming. 


Mechanisms 


With respect to the teaching machine of the fu- 
ture, I see two extremes with little in the middle. 
At one extreme will be the book formats—scram- 
bled text and programmed text. At the other ex- 
treme will be the large-scale computer-based sys- 
tems with multiple student stations. The more 
modest, mostly mechanical, machines which we see 
proliferating in today’s market will drop out of use 
in the school. Some of these less adaptive devices 
will continue to be used in laboratory research, 
new program development, or highly specialized 
work with small special groups. 


What seems to make this a likely possibility? 
Several quite different factors suggest it. Some 
are related to the teacher, others to the student, 
and still others to the established institutions which 
are most interested and sensitive to the economics 
of the different approaches. 

Cost. Enough of the small machines that control 
the exposure of paper, cards, slides, or filmed 
program could cost more than a computer-based 
system, and even if the cost were the same, the 
small machines would be more limited than the 
computer-based system. In the computer field, it 
is estimated that a 100 to one cost advantage may 
apply to the use of the large, fast machine, over the 
little one. 

These small mechanical teaching machines re- 
quire special formats for programmed materials, 
and these formats cannot be used inevery other ma- 
chine to get an adequate library for a school pro- 
gram in a different format from that needed for the 
available machines, which would have to be repro- 
duced with the necessary art work, etc. to fit the 
available machines. (What school would do this ?) 

Personnel requirements. The smaller machines 
need to be loaded and unloaded, They have to be 
set up for every new student, each period of the day. 
Every student has to be relocated inhis program at 
the point where he left off. This means that after 
the program is inserted, the place must be found 
and the recording tape reset. The teac her who 
does this for 25 to 50 students every 35 to 50 min- 
utes will find some way to escape from the situa- 
tion. Even figuring one minute per student, a con- 
servative estimate, these activities would add up to 
30 to 40 minutes per period per teacher in simple 
mechanical work. 

If the machine uses a program printed in paper 
scroll, rather than on separate sheets, then some- 
one has to make up the scroll. Usually this means 
attaching the separate pages together after mimeo- 
graphing them, and rolling them up for use. Once 
used, they need to be unrolled, Each scroll has to 
be individually handled to locate every student in 
the place where he left off at the last period. 

If teachers are to grade the students, they will 
have to examine the students' response records to 
decide about the students' progress. The other al- 
ternative is to give tests atthe end of each program 
unit. This latter procedure means that for each 
student at every period of the school day, the teach- 
er will have from 40 to 150 items per student to 
check before the next day. Also, if the time taken 
by the student is considered, then this is another 
datum to be processed each day. Put it all together, 
and it becomes a big job. Research workers don't 
mind doing these things. Infact, they are used to 
it; however, even they do not do it daily, year in 
and year out. Not many of today's teachers will do 
it. Consequently, it will be necessary to provide 
them with clerical assistance or а computer-based 
system that automates much of this work. 
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Both the book formats and the co m put er-based 
teaching machine systems can handle these problems 
with relative ease. With thecomputer, the program 
can be put on tape and fed in as needed for as many 
students as are scheduled at a particular time. 
Once the program is read into the computer, each 
student can ask for any problem. Within micro- 
seconds, he can have it available, provided no one 
is also asking for the same material at the very 
same time. If they are, however, he may have to 
waitonly a fraction of a second. Furthermore, as 
soon as he is finished with a program, the student 
can press a button and get a printed record of his 
performance already scored and the times record- 
ed. The teacher also can get these individual stu- 
dent records plus the collated record of any group 
of students to see how they are performing. 

With the new book formats, on the other hand, 
the students of the future would betold to read them 
and when finished, to take a test. These tests 
could be automatically scored by the usual methods 
if they were the objective type using printed answer 
sheets or cards. Also, the Pressey punch board 
could be used for immediate feedback to both stu- 
dent and teacher (Pressey, 1926). 

While some students may not follow the rules of 
thinking of the answer before looking, this is really 
no different from today's student who skims his as- 
signment and doesn't really study it. Both are 
really cheating themselves by not doing the things 
that would insure either better learning or better 
retention, or both. I don’t know why we should be 
so concerned about a student who looks ahead at an 
answer in a programmed text if we are not doing 
research on the material. After all, all the an- 
swers to questions are given the student who reads 
today's text-book. 

The critical differences between -today’s books 
and those that will be used tomorrow are not relat- 
ed to the giving of answers. They are differences 
in format and organization which control the se- 
quence of these experiences. One difference is the 
display. With the new programmed formats, ma- 
terial is broken down so that important points аге 
salient. Thus, both the search time is reduced and 
the chances of finding the essential information are 
assured. Тһе student is focused оп the critical 
terms. 

Differences between a prompting and a confirma- 
tion sequence contribute to the effectiveness of a 
program. Тһе method used by the student when he 
looks ahead to the answer without **guessing'' what 
it is first is prompting. In effect, the student is 
converting a confirmation to aprompting procedure 
by his procedure. Ав indicated earlier, the data 
on this problem suggest that in doing this, the stu- 
dent may learn more rapidly, but he may remember 

itless well (Stolurow, 1961). Thus, the book for- 
mat has the obvious disadvantage of poor control. 

Also, arguing for the increased use of the book 
formats is the fact that students say they like them. 


The data say they learn from them. Theobstacleto 
their use, if any, seems to be the teacher. He has 
to learn to like a book that is designed solely to 

teach. This means that he has no index to use and 
really cannot look up material іп it. It is not a ref- 
erence. Furthermore, looking at the page number 
does not tell him that John, who is on page 100, is 
ahead or behind Susy, who is at page 50. Both the 
scrambled page book and the zebra page program- 
med text make page position a poor index of prog- 


> ress. Even іп research, where we use separate 


answer sheets, estimates of student progress are 
difficult because the entries may be greatfor either 
of two reasons: a) rapid progress, or b) many er- 
rors. One has to examine the entries to distinguish 
these possibilities. 

For these reasons, two extreme forms of these 
new developments are likely to survive and remain 
in use in the future— the relatively “іпехрепвіуе” 
book and the relatively “ехрепвіуе” computer. 
These economic labels probably are misleading. It 
could well be that the computer is man's gift to his 
fellow taxpayers if a large number of students are 
being taught by it, for it might be more economical 
than the purchase of a large number of individual 
machines. Certainly programs ‘would be cheaper 
in large numbers to produce electronically, once a 
tape was made, than to produce in printed or filmed 
form, 

While the computer -based system offersthe pos- 
sibility of reducing cost in production of steps for 
individual students and in processing data, it is by 
no means an established system inthe field of auto- 
instruction. Furthermore, thereare many unsolv- 
ed problems in using a computer-based system,and 
until they are solved and systems are inuse, it will 
be necessary to live with the smaller devices and 
the new printed text formats. 

А final prediction is that students who do well 
on programmed instruction will tend to be those 
who are more original than those who do less well. 
In other words, it will be found that there will be a 
positive and significant relationship between per- 
formance оп tests of achievement following pro- 
grammed instruction and on problem-solving tasks 
that require originality for their solution. 

This prediction already has some support in 
data. Bass, Golden, Hatton, McHale, Suh, Davis, 
Morgan and Stolurow recently studied this problem 
using just a small group of gifted students (N = 20; 
CA 13 - 15 yrs. ; IQ 112 - 157) who participated in 
а problem-solving institute whichwe setup and ran 
this past summer. Their learning experience con- 
Sisted of self-instructional programmed materials 
in logic, mathematics and statistics over a вїх- 
week period of four to five days per week. These 
materials were deliberately selected because of 
their relevance to a wide Spectrum of problem- 
solving situations, so the results may not apply to 
other subject matters. 

The students were given three Guilford (1954) 
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CORRELATIONS WHICH WERE SIGNIFICANT WITH ONE PROGRAM SEQUENCE 


BUT NOT FOR THE OTHER (EVANSTON STUDY) 


Variables Correlated* 


T 


Consec. Mixed 


Program Program 
1. Immediate Post-test Full Seale IQ ‚614 
2. Immediate Post-test Reading Comprehension ‚511 
3. Immediate Post-test Arithmetic Fundamentals .T38** 
4. Immediate Post-test Total Language .634** 
5. Part Score MA .499 
6. Рагі 5соге Full Scale IQ . 605 
7. Percent of Maximum Gain MA .513 
8. Percent of Maximum Gain Full Scale IQ .605 
9. Percent of Maximum Gain Arithmetic Reasoning .531 
10. Percent of Maximum Gain Arithmetic Fundamentals .452 " 
11. Percent of Maximum Gain Total Arithmetic ‚509 
12. 21-Day-Delay Test (DDT) Arithmetic Fundamentals . 641** 
13. 21-Day-Delay Test Total Arithmetic . 602 
14, 21-Day-Delay Test Total Language . 543 
15. 21-Day-Delay Test Total Battery Grade Point .636** 
16. 21-DDT, Part A Reading Vocabulary .620%% 
17. 21-DDT, Part A Reading Comprehension .593** 
18. 21-DDT, Part A Total Reading . 634** 
19. 21-DDT, Part A Arithmetic Reasoning . 508 
20. 21-DDT, Part A Arithmetic Fundamentals ‚544 
21. 21-DDT, Part A Total Arithmetic .550 
22. 21-DDT, Part A Total Language .516 
23. 21-DDT, Part A Total Battery Grade Point .595 
24. 21-DDT, Part B Arithmetic Fundamentals ‚611 
25. 21-DDT, Part B Total Language .505 
* Immediate Post-test correlated with Part Score ‚549** 
Immediate Post-test correlated with Percent Gain .804** 
Immediate Post-test correlated with 21-Day Delay Test Total .372 
Immediate Post-test correlated with 21-Day Delay Test Part B В d 


Part Score correlated with 
жж Significant at . 01 level; unstarred values 


Percent Gain 


are significant at . 05 level. 
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type tests (Consequences, Unusual Uses, and Plot 
Titles) and two TAT cards withthe requirement that 
they make up stories about them which we then 
Scored for originality. In addition, many other 
Scores were obtained, e.g., IQ, grade point, etc. 
The Guilford originality scores correlated sig- 
nificantly with the statistics unit end-tests of a- 
chievement given after each of the parts of tne total 
program, and with the final examinationcovering 
the entire statistics program. Тһе originality 
Scores also correlated significantly with the logic 
tests given after completion of that program. The 
TAT originality scores correlated significantly with 
the mathematics final examination. Оп the other 
hand, IQ did not correlate significantly with any of 
these scores based upon programmed learning. 
Furthermore, as Getzels and Jackson (1959) found, 
the correlations between originality and IQ were 
' essentially zero. 

We think of these findings in relation to some of 
our earlier research results ontraining for trouble 
shooting and diagnostic! decision-making. In that 
work we identified three basic types of complex 
linkage paradigms that seemed to be present in a 
wide variety of complex learning situations, includ- 
ing the three areas of instruction used in the pro- 
grams given to our summer institute students. To 
the extent that auto-instructional materials require 
the student to build up specific, but multiple, asso- 
ciation linkages, then success in learning this ma- 
terial is going to be related to originality or diver- 
gent thinking, which is operationally defined as the 
ability to make specific, but multiple, associations 
to a stimulus such as “Ноу many different uses 
can you think of for a brick?"4 

Presumably, an effective auto-instructional pro- 
gram will build up specific, multiple linkages and 
to the extent that it is effective in doing this, it 
should be well established. Thus, if good perform- 
ance on the achievement test is dependent upon the 
formation of specific multiple linkages, then we 
could expect the predicted correlation. In other 
words, to the extent that a program builds what 
Skinner has called a ‘‘web of associations,” it 
Should provide the basis for making associative re- 
Sponses along several, rather than just one dimen- 
sion. If it does this, then the individual should be 
able to make many different responses to а given 
stimulus. 1f faced with a problem-solving task in- 
volving the use of remote associations, the learn- 
ing of multiple associations could be useful partic- 

ularly if we think of each one as cue produc ing. 
This would mean that the student would be able to 
produce more cues for his own responses, and in 
this sense, be more likely to generate the remote 
associations required for a difficult problem. This, 
of course, is speculatiee, but does provide a hypo- 
thetical base for the prediction and for generating 
additional research, some of which is currently in 
process. 


Summary 


With reluctance, but with the hopethatthey might 
be interesting, if not useful, eleven predictions 
were made about the future of auto -instructional 
methods and devices. They are: 


1. These methods and devices are here to stay. 

2. Severalthings will be done to acquaint teach- 
ers with the potentiality of these developments. 

3, The comparative study of live and automated 


` teaching will stop. 


4. Future research will concern itself with dis- 
covering the important characteristics of the ma- 
terials and methods. 

5. These developments will lead to a theory of 
teaching. 

6. Courses will be revised as a result of the 
new insights provided. 

7. Basic changes in our thinking about what a 
course is will take place. 

8. Research will demonstrate the importance of 
sequence factors. 

9. We will see a new form of dynamic individu- 
alized programming called ability -pattern pro- 
gramming. 

10. The devices of the future willbe either books 
(programmed or scrambled) or computer-based 
machines; small devices will drop out. 

1l. Learning from auto-instructional program- 
ming will be shown to be capable of aiding persons 
to solve problems creatively. 


FOOTNOTES 


1. Presented at the First Midwestern Conference 
on Teaching Machines and Programmed In- 
Struction at Purdue University, LaFayette, 
Indiana, November 17, 1961. 

2. Personal Communication of October 27, 1961. 

3. Unpublished research obtained in a study con- 
ducted under Cooperative Research Project, 
Contract No. SAE 8370, U. S. Department of 
Health, Education and Welfare, Office of Edu- 
cation. 

4. ONR sponsored Research. 
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Computer-Based Teaching Machines 


COMPUTER-BASED teaching machines are now 
Serving as research tools in a number of public 
and private institutions. The scope of this re- 
search in computer-ordered auto-instructional pro- 
grams was definitively circumscribed at a winter 
conference on the application of computers to auto- 
mated instruction held in Washington, D.C. ** 
(Many of the papers presented at this conference 
are now published in the book, Programmed Learn- 
ing and Computer-Based Instruction, edited by Dr. 
John Coulson and published by John Wiley and Sons, 
May 1962.) The significance of this work to the 
planning of future educational systems should not 
be overlooked, for it is very possible that the com- 
puters used for data processing and for informa- 
tion retrieval in our schools will also adapt auto- 
instructional programs to individual students’ 
needs and abilities in the not too distant future. It 
is because of the promise of this research and its 
relevancy to advanced educational planning that 
several of the invited papers presented at the 
Washington computer conference are here selected 
and abstracted for this special issue of The Journal 
of Educational Research. Before exploring these 
research reports, Owever, several introductory 
statements are needed. 


Machine Responsiveness to Student Learning 
Behavior 


The potential of the computer for handling indi- 
vidual student differences in learning rate, back- 
ground, and aptitude is of primary interest to 
most researchers working with computer - con- 
trolled automated teaching systems. Machine re- 
sponsiveness to student learning behavior can be 
achieved by branching the student forward, later- 


DON D. BUSHNELL 


Santa Monica, California* 


ally, or backward through subject materials pri- 
marily on the basis of his response to content ques- 
tions seeded throughout the instructional program. 
Criteria for branching can be many and varied 
(sometimes depending upon the sophistication of 
the equipment and the computer programmer), but 
generally will include one or more of the following: 


l. Characteristics of student response — the 
promptness and/or definitiveness of his 
reply. 

2. Nature of response — was it right or 
wrong, what specific errors were com- 
mitted by the student. 

3. History of student learning behavior — his 
previous response pattern, problem areas, 
and reading rate. 

4. Relevant student personal data — his IQ, 
Sex, personality, aptitudes. 

5. Nature of subject matter. 

6. Degree of student motivation. 

7. Student-generated requests for re-routing. 


The term ‘‘flexibility’’ has two applications in 
connection with the computer-based teaching sys- 
tem. First, the machine is capable of modifying 
its own mode of instruction during the course of a 
training session. Item Sequence, knowledge of re- 
sults and other such variables depend on the fac- 
tors listed above as well as on the instructions 
programmed in the machine. It is this type of re- 
Sponsive self-modification which significantly dis- 
tinguishes this experimental machine from the 
myriad varieties of teaching devices already іп ех- 
istence. Conceivably this unique characteristic 
may lead to a major improvement over other con- 
ventional and automated instructional techniques in 


* Center for Research in System Development, System Development Corporation, 2500 Colorado Avenue. 
** Conference on Applications of Digital Computers to Automated Instruction, October 1961. Cosponsored 
by the Office of Naval Research and the System Development Corporation. 
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terms of teaching effectiveness. 

A second type of flexibility, of great importance 
to any teaching machine intended as a research 
tool, pertains to the ease with whichthe machine's 
operation can be altered between training sessions. 
Such alterations are necessary to permit the inves- 
tigation of different training procedures or machine 
characteristics. 

The ‘‘responsive’’ experimental teaching ma- 


chine demands a considerable degree of complexity 


in the machine control unit. The only type of con- 
trol unit which provides the necessary flexibility is 
the digital computer, because the computer is the 
only instrument capable of determining the item se- 
quence and knowledge of results to be presented to 
the student as well as carrying out the bookkeeping 
activities upon which such determinations will be 
based. 

Inputs from the student are usually introduced 
by means of a typewriter keyboard connected to the 
computer control unit. These inputs consist of the 
student’s responses to instructional items, or, in 
some cases, of requests for further information 
from the computer. 

Two general types of stimulus material are usu- 
ally presented by the computer-controlled machine 
to the student. The first type, consisting of the 
actual instructional items, can be displayed by a 
slide projector or TV display acting under the di- 
rection of the computer control unit. Signals from 
the computer control the sequence of items to be 
presented. 

The second type of computer-based teaching ma- 
chine output consists of evaluative information con- 
cerning the student’s performance on the instruc- 
tional items. This information is often printed out 
on the page printer of the same typewriter used for 
student inputs. 

This brief description of the essential elements 
of an automated teaching system controlled by the 
digital computer does not, of course, hold for the 
specific configurations of components used in the 
research described below. It should also be 
stressed that the tools of this research are of less 
significance than the purpose for which the tools 
have been devised, a point sometimes overshadowed 
by the luster of the computer. 


Abstracts of Papers Presented at the Conference 
on the Application of Digital Computers to Auto- 
mated Instruction 


University of Illinois. “pLATO I: А 
Multiple-Student Computer-Controlled 
Automatic Teaching Device, ? paper pre- 
sented by W. W. Lichtenberger from the 
Coordinated Science Laboratory. 


PLATO (Programmed Logic for Automatic 
Teaching Operations) is the name of a research 


project using an ILLIAC computer-controlled sys- 
tem of slides, TV display, and student response 
panel for simultaneously instructing two students 
in either mathematics or French. The uniqueness 
of the PLATO II system lies in its input flexibility 
and the degree of control the student has over the 
sequences of educational materials he sees for 
study. When questioned on the content of material 
viewed on the TV screen, the student can transmit 
his answers in numerals, words, sentences, ог 
algebraic expressions. The ILLIAC displays each 
character selected by the student in the blank 
space appearing with the question on the television 
screen. By pressing a 'judge' button, the student 
causes the computer to write either “ОК” or “Хо” 
next to his answer. If he is wrong, the student has 
the capability of asking for additional help to which 
the computer responds by selecting simpler but 
related material until the student presses the 
* Aha! button and is returnedto the question missed. 
PLATO II, although fashioned after the model of 
the human tutor, has only a limited degree of 
adaptability to student requests. If the additional 
‘help’ routines of three or four slides are not suf- 
ficient to clear up problem areas, the student is 
given the correct answer and allowed to proceed. 
However, richer variable routines are apparently 
indesign. Some means of branching on the basis 
of (1) specific errors made in a wrong response, 
(2) the number of wrong responses submitted for a 
given concept, and (3) how much help was required 
to complete a given item are also being explored. 
According to an announcement by the Coordi- 
nated Science Laboratory, PLATO will ultimately 
be modified to teach many students at the same 
time. 


Bolt, Beranek, and Newman, Inc. 
‘preliminary Experiments in Comput- 
er— Aided Teaching, " paper presented 
by J. C. Licklider. 


A Digital Equipment Corporation, PDB-1 com- 
puter has been programmed to control two type- 
writer stations for instruction in language vocabu- 
lary drill. The computer types out a word from à 
setof German words. The student responds by 
typing a corresponding word in English. If the re- 
sponse is correct, the computer confirms its cor- 
rectness, posts a hypothetical score, gives acom- 
pliment, eliminates the word from the set, and 
presents a new word from that set. If the student 
has been incorrect, the computer says ‘No, > moves 
the item to a missed set, and offers him the choice 
of trying again or seeing the correct answer. When 
the student has finally replied appropriately, the 
computer posts a score, makes a suitable remark, 
and presents the next item in the sequence or а 
missed item from an earlier set. 

This research uses the computer to simulate 
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only limited machine responsiveness to errors 
committed by students. Licklider contends that 
concern with the hardware (displays, controls and 
procedures) is more effective in facilitating learn- 
ing than adherence to certain learning principles 
derived from the psychological laboratories. 


Thompson Ramo Wooldridge, Inc. 
*Computer Techniques in Instruction,” 
paper presented by R. L. Chapman. 


Thompson Ramo Wooldridge has developed an 
auto-instructional device with a control system 
which is smaller and less expensive than ageneral- 
purpose computer but which incorporates a large 
amount of operational flexibility. This device, 
called MENTOR, provides both visual and auditory 
display capabilities for training one student at a 
time. The student responds through push buttons 
and is given immediate knowledge of results. MEN- 
TOR can branch on the basis of single questions, 
cumulative performance scores, or response time 
latency. It keeps a record of frame sequences, 
student responses, and student response times. 

One prototype of MENTOR has been built for re- 
search. However, no research with this unit has 
been accomplished to date, although a long-term 
program of activities has been drawn up. 


System Development Corporation. 
“Computer-Based Educational Sys- 


tems,” paper presented by J.E. Coulson. 


Results of studies on variable-sequence methods 
of automated instruction were briefly summarized. 
These experiments were performed with a Bendix 
G-15 controlled instructional system using а ran- 
dom-access slide projector for presentation of ma- 
terials and an electric typewriter for student re- 
Sponses and feedback messages. Various branch- 
ing techniques have been studied with Secondary 
School and junior college students acting as exper- 
imental subjects. Optimum sequences of materials 
have been sought for individual students by branch- 
ing students on the basis of specific errors, errors 
over a series of diagnostic questions, student re- 
quests, and latency measurements. 

In a recently completed educational laboratory, 
called CLASS, a larger transistorized computer, 
the Philco 2000, is used to provide high-speed 

data-processing assistance to many parts of asim- 
ulated educational system. Individualized auto- 
mated instruction, using variable Sequence proce- 
dures worked out with the the Bendix G-15 System, 
is given simultaneously to 20 students. Other in- 
Structional techniques such as closed-circuit TV, 

films, and conventional lecture, can be combined 
with automated instruction in CLASS to determine 
the most effective educational methods for different 
Subject areas and different types of students. 


CLASS also allows teachers, counselors andad- 
ministrators to call up computer-generated dis- 
plays of current student performance and previous 
student records. Budgetary, logistical and sched- 
uling information can be used in making decisions 
during simulated training problems for adminis- 
trators. 


International Business Machines Cor- 
poration. “Оп Conversational Inter- 
action, "' paper presented by W. R. Uttal. 


Using the model of conversational interaction 
between the student and his tutor, the computer 
Scientists and subject matter specialists working 
together at the IBM Research Center have devel- 
oped an IBM 650 RAMAC system as a research ve- 
hicle. Experiences encountered in the preparation 
of a course in German, a course in stenotypy, and 
а course in statistics were presented. In general, 
the interaction between the student and the teach- 
ing machine proceeds in the following manner: 


1. The word, formula, or phrase to be encoded 
Or learned is presented to the student. Ac- 
cording to his record of experiences with 
this problem, it may appear with appropri- 
ate cues. 

2. The student keys his response into the key- 
board. When he has released all keys, the 
information is entered into the computer. 

3. The computer evaluates his answer and in- 
dicates whether he was correct or incorrect. 

4. Anew item is presented to the student or 
the same item is repeated. Either contin- 
gency is determined by the branching logic 
in the computer. $ 

5. After a number of these iterhs are presented, 
a lesson is considered complete and the stu- 
dent is queried to determine if he wants to 
Continue. During that inquiry, the computer 
is assembling a new set of items from the 
list of new problems or from the list of prob- 
lems which have been answered correctly as 
well as items from the error buckets (a set 
of memory locations which store the address- 
ев of items which have been missed a given 
number of times). 


It was emphasized by Dr. Uttal that all three 
courses taught by the RAMAC system were inearly 
Stages of development and actual research using 
the materials was yet to be accomplished. He feels; 
however, that a satisfactory theoretical and exper- 
imental foundation for teaching machines may not 
be laid down until computer-based instructional 


systems are an accomplished fact in real school 
Systems, 


Carnegie Institute of Technology. “Тһе 
Uses of the Digital Computer in a 


t 


à 
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Freshman Program,"' paper presented 
by A.J. Perlis, Director, Computation 
Center. 


At Carnegie Tech., students are learning to 
program a computer as well as a machine simu- 
lated on it through an algebraic and list-processing 
language. In the study of computer organizations, 
students are informed by text and problems stored 
within the computer of the nature of this organiza- 
tion and how it relates to that of a simulated ma- 
chine. Their progress in understanding is meas- 
ured by programs within the computer, which com- 
pare the student's answers to those of previously 
provided standards. 


Through the simulation of the machine on ama- 
chine, the general properties of programming are 
explained. Each property is described by text, 
tested by problems, and graded for understanding 
by the computer. 


According to Dr. Perlis, the purpose of this 
ongoing programming course is to describe cer- 
tain programming principles: parametrization of 
processes, iteration and cycles, recursion, oper- 
ational procedures of definition, representations 
and languages, mechanical communication between 
processes, and simulation. 


The Future School System and Computer Technology 


Recognizing that the most practical and imme- 
diate need for a computer in school systems will 
be to automate paperwork and bookkeeping func- 
tions, the research in computer-based instruction- 
al devices and the use of computers in instruction 
portends a broader application in school systems 
than the computer's more prosaic functions might 
lead one to expect. The evidence for this projec- 
tion lies in the work described above and in other 
recent developments in computer technology: 


1. Information retrieval systems, i.e., auto- 
mated library services utilizing abstracting 
and translating machines. 

2. Computer programs for aiding management 
and decision-making activities in schools 
paralleling those in business which supply 
periodic economic forecasts, balance budg- 
ets, and plan financial strategies. 

3. New techniques for processing educational 
data such as real-time registration and 
Scheduling programs, counselor and teacher 
data displays, automated diagnostic inter- 
views, and testing routines. 


The application of the digital computer to edu- 
cational functions holds the potential of an import- 
ant technological breakthrough in education. 
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DOES CROSS-GRADE GROUPING IMPROVE READING INSTRUCTION? Ramsey eval- 
uated the so-called Joplin planas practiced in Logansport, Indiana, and conclud- 
ed that the system seems to have merit, particularly for those students in the 
upper third in intelligence. However, forthose pupils inthe lower third in intel- 
igence, the system did not produce gains as great as expected, In general, 
teachers, children, and parents all looked with favor on the program. 


*BEFORE WE ADOPT MACHINES AND PROGRAMMED 
TEACHING as an integral part of the teaching process, it would be 
well that we turn to educational researchfor good answers,” Ander- 
son and Edwards say. Evenasthe American public school is chang- 
ing rapidly to meet the demands placed upon it by the twin explosions 
in population and knowledge, the writers are impressed by the 
sparsity of the evidence adduced which would either affirm or deny 
the efficacy of many of the large experimental programs. 


HAVE THE FACTORS AFFECTING SUCCESS IN SCHOOL CHANGED MUC H IN THE 
PAST 20 YEARS? No says Dugan after examining 105 tenth-grade biology stu- 
dents in New Jersey. Low IQ, lack of family educational tradition, emotional 
difficulties, shallow interest, and uninspired instruction are still responsible 

* for drop-outs, despite economic prosperity, modifications in the school curric- 
ulum, and increased school attendance. Why these factors are persistent and 
what to do about them will require more research, the author suggests. 

аг 


ART, ARITHMETIC, AND READING аге the subjects ‘‘signifi- 
cantly preferred" by elementary school pupils in California, Green- 
blatt reports. The social studies occupy a nebulous middle ground, 
neither preferred nor rejected. Boys show a special liking for sci- 
ence, girls for music. Pupils with an IQ of 110 or higher tend to 


prefer the same subjects, and these subjects are those preferred by 
teachers. 


better penmen than those southpaws who approach from the left side of the paper 
and more or less “һоок” the wrist while writing, Enstrom says. 


THREE UCLA PSYCHOLOGISTS offer a new method for evalu- 
ating the effectiveness of treat ment of learning difficulties. Built 
around pre- and post-tests, the system permits the assessment of 
an individual pupil's progress, and can also be used as a research 


LEFT-HANDED WRITERS who keep the writing hand below the writing line tend to be 
tool in relating personality, task, and procedure variables. 


A TEACHING-MACHINE COURSE IN SPELLING did not stack up very well in Penn- 
sylvania in comparison with conventional methods, Buzby and Mann report. The 
TMI-Grolier **Fundamentals of Spelling" program presentedon Min/Max Teach- 
ing Machines did not appear to hold any advantages over ordinary classroom or 


flash-card methods. 


WHAT DO STUDENTS SEE AS FACTORS AFFECTING TEST 
PERFORMANCE? Working with undergraduates at LSU, Gaier ob- 
tained these results: 1) High-grade students associate test success 
with activities on their own benalf more often than do low-grade stu-  - 
dents, 2) test success or failure attributed to the instructor is more 
frequent in “А?” students than in “D” students, and 3) in attributing 
test success or failure to external factors, low-grade students in- 
troduce outside influences approximately three times more often than 


high-performance students. 
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The Educational Process and Programmed Instruction 


THE CHANGES WHICH have and are taking 
place in the United States can be ascertained by the 
most casual observer. Тһе tempo of change has 
accelerated within our own lifetimes, and asthe 
mere business ofliving from day to day becomes 
more complex, countless new demands are made 
upon our schools. We һауе experienced an expo- 
nential growth in the production of radios since the 
crystal sets of 1923 and of passenger cars since 
1905. We are now experiencing perhaps an even 
more rapid growth with regard to television. Un- 
doubtedly the number of jet-passenger miles has 
increased greatly during the past three or four years. 
What we learn from this is that the development of 
an industry from a zero point to a point of near- 
saturation covers a period of time smaller than 
that during which any youth is required to attend 
School. Thus, each individual emerges from School 
into a world whose technological development and 
political and social organization differs consider- 
ably from that ofthe world into which he was born. 
The implications of this last statement for educa - 
tion are horrendous, for accompanying the growth 
in population is the equally frightening explosion of 
knowledge. Thus, to the two great problems in 
modern education, quantity and quality,there seem 
to be no easy answers. 

Since the dawn of the space age on October 4, 
1957, many critics of American education, no 
longer earthbound, contendthat we must choose be- 
tween educating a small segment of our population 
exceedingly welland trying to educate the remaining 
mass less well. Thisis an assumption that has not 
as yet been proved; and yet, American education is 
urgently in need of re - examination and improve - 
ment and we have been reluctant to make the hard 
decisions necessary to galvanize us into action. 
This has stemmed primarily froma lack of psycho- 
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logical ownership or a shortage of beliefs to which 
all schooling can be tied. Until the people have psy- 
chological ownership of the schools, they cannot 
know what they want education to do. 

And now, tomake the picture even more complex, 
conferences are being heldto considerthe potential 
for learning of a new arrival on the educational scene, 
a newcomer which may inthe immediate years ahead 
trigger the mass production of teaching materials 
for individualized instruction comparable in sheer 
magnitude to the growth we have experienced rela- 
tive to the production of radio and television sets. 
This newcomer is improperly termed a teaching 
machine which does not teach. It simply brings a 
potential learner in contact with materialit presents 
or that which was programmed by a person or team 
of persons. Skinner has this to say: 


The machine itself, of course, does not 
teach. It simply brings the student into con - 
tact with the person who composed the material 
it presents. Itisa labor saving device because 
it can bring one programmer into contact with 
an indefinite number of students. This may sug- 
gest mass production, but the effect upon each 
student is surprisingly like that of a private tu- 
tor. The comparison holds in several respects: 
(a) Thereis a constant interchange between pro- 
gram and student. Unlike lectures, textbooks, 
and the usual audio-visual aids, the machine in- 
duces sustained activity. The student is always 
alert and busy. (b) Like a good tutor the ma- 
chine insists that agiven pointbe thoroughly un- 
derstood, either frame by frame or set by set, 
before the student moves on. Lectures, text- 
books, and their mechanized equivalents, on the 
other hand, proceed without making sure that 
the student understands and easily leave him 


538 THE JOURNAL OF EDUCATIONAL RESEARCH 


behind. (c) Like a good tutor the machine pre- 
sents just that material for which the student is 
ready. It asks him totake only that step which 
he is at the moment best equipped and most 
likely to take. (d)Like askillful tutor the m a- 
chine helps the student to come up with the right 
answer. It doesthis inpart through the orderly 
construction of the program and in part with 
techniques of hinting, prompting, suggesting, 
and so on, derived from an analysis of verbal 
behavior. (e) Lastly, of course, the machine, 
like the private tutor, reinforces the student for 
every correct response, using this immediate 
feedback not only to shape his behavior most 
efficiently but to maintain itin strength in 
a manner which the layman would describe as 
‘holding the student's interest. ” 1% 


Before we let this intruder, and we use the term 
advisedly, gain a firm foothold in the schools, we 
Should examine its credentials in terms of the ob- 
jectives we hope to achieve. We should evaluate its 
performance before we make ita member of the 
teaching team. We should research it out to see if 
the claims made relative to its efficiency in the 
learning process are authentic. 

Itis important at this point to ask the question: 
Is not the central and underlying purpose of research 
in learning the improvement of instruction? If this 
be the primary purpose, thenthe real problem is the 
evaluation of the outcomes of instruction via teach- 
ing machines and programmed learning. Whatare 
some important objectives or criteria of learning? 
These may be stated in many ways, but are not the 
following almost all-inclusive? Has the learner: 


l. Acquired and retained useful and pertinent 
information of a factual nature? 

2. Acquired and retaineda wo rkable under- 
standing of the principles orbig ideas of an 
academic area? 

3. Learned to useintelligent methods іп адарі- 
ing to the problems of his life? 

4. Reached a level of understanding, applica- 
tion and performance in the above three 
which is commensurate with his ability? 


Regardless of whether or not you subscribe to the 
above criteria in terms of acontinuum or Spectrum, 
it seems reasonable to ask the question: Where 
does the teaching machine and programmed learning 
fit in the above scheme ofthings? Will thenewcomer 
Operate best with factual information and less well 
with principles? What does research tell us about 
the effectiveness of complete courses on film or the 
wise selection and purposeful use of films in the in- 
structional process? In two studies 2; 3 designed to 


= E 
* Footnotes will be found at the end of the article. 


evaluate the effectiveness of the Encyclopaedia Bri- 
tannica Physics and Chemistry Films, essentially 
negative results were obtained inthat the non-film 
groups did as well if not better than the film groups 
during a period of one year of instruction. However, 
in an earlier study, 4 the teachers and students were 
made aware of the principles of biological science 
covered or stressed in each film. The focus of the 
instruction was on the principles and the teaching, 
therefore, was purposeful. The results, discounting 


“апу **Hawthorne Effect, ’’ would seem to indicate that 


when the instruction via films is purposeful and 
planned in terms of objectives to be reached, posi- 
tive increments in learning occur. One might also 
say that films should not replace the regular mode 
of instruction or the role playedby the instructor. 
Teaching with or without films, television, or ma- 
chines, will be effective only as we employ 
teachers who know their subject matter and have 
the know-how to put the material across. А good 
teacher may greatly augment his effectiveness by 
the proper use of mechanical aids. This means 
selection of good films, good television programs, 
and good teaching-machine programs, which fit in- 
to a plannedand purposeful program of instruction, 
the objectives of which are known by both the 
teacher and herstudents and which are above the 
fact-knowing level or which are challenging to both 
parties in the instructional process. Without these 
considerations, the process of learning is nobetter 
off than it was before day of mechanical aids. 

Despite these considerations, we have a new - 
comer on the educational scene and in all fairness 
we must give the devil his due. Before we do so, 
what is meant by programmed learning or teaching? 
Edgar Dale describes it as follows: 


Programmed teaching is a method of self - 
instruction which has these characteristics: 

1. Ithas been developed chiefly in those 
fields of learning where the basic elements are 
predictable and systematic. 

2. The instructional material is organized 
into steps, called frames. Themostdesirable 
Size of these steps is debated by learning the- 
Orists. 

3. The student reads andtests himself on 
each of these progressive Steps by an overt 
writing response, by checking an answer, or 
pressing a key. The correct answer is then 
exposed or indicated, enabling the student to get 
immediate knowledge of the correct or incor- 
rect response. Good programmers may vary 
the stimuli, present a Slightly different item 
for one missed. 

4. One theory of programming holds that 
programs should be so carefully written that the 
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learner makes only correct responses. Some 
theorists believe that there is value in learning 
“to detect an inaccurate ог inappropriate item.” 
5. The student proceeds at his own speed. 
Review items may be ‘‘seeded’’ throughout the 
material to make sure that what has been 
‘learned’ is reinforced at appropriate times. 


How effective is programmed teaching? The an- 


swers will be slow in coming. An examination of , 


the literature revealed that fewer than ten studies 
appeared each year during the period 1948 to 1957. 
In 1958, about 35 teaching machine studies appeared 
in various periodicals and magazines and about 55 
appeared іп 1959. “Тһе major portion of the pub- 
lished material deals with the theoretical aspects of 
the use of automation in teaching in which various 
writers and researchers, mostly experimental psy - 
chologists, discuss the advantages anddisadvantages 
of the use of machines. А second, butsmaller, por- 
tion of the literature reports research on the learn- 
ing process andonthe theoretical issues of program- 
ming. In this category the research is concerned 
with whether or not students can learn with the use 
of machines and deals with questions such as mode 
of response, size of step, branching, and learning 
sequences.?® Important as these researches are, the 
category of research of most significance to prac- 
ticing educators is even smaller in quantity. This 
is research which attempts to answer the question: 
Does programmed teaching via machines result in 
greater learning than do the conventional methods 
now employed? Until this question is answered with 
concrete evidence based on good research, we 
should proceed with caution. Good research is based 
on three essentials: (1) organized knowledge, (2) 
observation and experiment, and (3) theory. In 
education, it is a most difficult task to achieve а 
blending of these essentials in the proper proportions 


since the area of endeavorisa complex of many things. 


Its level of theory developmentis low and its breadth 
of coverage is extremely wide. These in combina- 
tion have led to an “© arm-chair ”’ approach to the 
problems of education by many self-appointed ex- 
perts in and outside the field whose repertoire is 
lacking in one or more or all of these three essen- 
tials. Experts, otherwise well-equipped, һауе 
avoided observation and experimentation as a source 
of truth due to the remarkable breadth and complex- 
ity of educational measurement. It is important in 
setting forth the plans of an experiment to answer 
the questions which prompted the research and to 
list all of the variables that might conceivably influ- 
ence the results. Pertinent to this statementy we 
must have good research upon whichto make admin- 
istrative decisions about using programmed teach- 
ing in partor all of the academic program. Answers 
to several issues or questions must be forthcoming 
before we jump on the bandwagon. Edwards | has 
raised such issues and questions as the following: 


The first of these issues deals with the prob- 
lem of economics, a straightforward question 
as to whether the gains in learning (if any) are 
sufficient in terms of capital outlay both for 
machines and programs for machines. In the 
public schools, already harassed by financial 
problems, this question cannot be dismissed. 
There must be evidence supplied that the chan- 
neling of funds intothe new area yields suffi- 
cient return... 

There are problems of administrative econ- 
omy other than the monetary one, as well. 
Chief among these, tome, is one of adaptabil- 
ity. School people mustask questions such as 
these: Аге the machines limited іп their use to 
a specific group? Are the programs available 
useful for the general case and therefore lim- 
ited in their usefulness with the extremes? 
Are changes possible in programs by specific 
teachers, and under what conditions? Can 
programs be constructed by individual teach- 
ers, or by some individual responsible for such 
a process in the school system? Do the pro- 
grams apply to processes and facts only, or do 
they also reflect problem- solving and general - 
ization processes? 

...The less adaptable administratively, the 
weaker the case for using machines... 

The question of usability is also economic 
in nature. The principal concern here is of 
application to specific courses or kinds of 
courses and tothe school level or grades with 
which programmed teaching might be used. 
Administratively, if machines are adaptable 
only for certain courses (suchas mathematics 
and science), and not for other courses (such 
as English and history), their utility may be 
limited. At the same time,the more grade 
levels with which the machines may be used and 
for which programs are available, the better 
the case, administratively, which may be made 
for their inclusion in curricular planning. 

A final question in this area which I would 
raise deals with the function of machines in 
academic programs. In any course, orse- 
ries of courses, there are academic goals and 
objectives which are established. These goals 
are both short-term and long-range in nature. 
School personnel are interested in immediate 
effects through incorporation of knowledge. Ав 
an educational procedure, this is necessary 
but not sufficient. There arelong-range goals, 
reflected in examining immediate knowledge 
through perspective to its applicability to many 
areas. The question must be raised: Are we 
insuring high levels of immediate knowledge 
to some detriment of future use? 

Edwards summarizes his administrative con- 
siderations as follows: 
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Essentially, the questions I have raised 
from an administrative standpoint шау be re- 
duced to one of general scope: In comparison 
with current achievements, and lack of them, 
academically, how will programmed instruc- 
tion measure up? If itallows a better job, 
then school personnel must incorporate ma - 
chines in the curriculum. If not, programmed 
learning has little to offer. 8 


It might be well atthis point to consider some of 
the psychological aspects of programmed teaching. 
Should it provide for differences in intelligence? Up 
until now, teaching machine programs have.generally 
been written with little consideration for different 
levels of intellectualability. Experimental evidence 
on the subject is meager. However, Porter? found 
no relationship betweenIQ and achievement in Spell - 
ing. Briggs and Вевпаг410 Suggestthat when more 
complex materials are involved, several programs 
might be desirable for different ability levels. 
Shay, 11 in studying the relationship of intelligence 
to step sizeonateaching machine program, obtained 
essentially negative results for a fourth grade unit 
in roman numerals. Saltzman makes the following 
statement: " 


Until there is evidence which indicates 
otherwise, I suggest that the programer for- 
get about the problem of individual differences 
while he is in the process of developing his 
program. He should aim his program at the 
“average’’ or slightly below average student 
and develop only one form of the program. 
Later, when the program is tested, he may 
be pleasantly surprisedtolearn that thesame 
program is versatile enough to be used satis- 
factorily not only with average students, for 
whom it was prepared, but also with slow and 
bright students, the slow Students taking more 
time and the bright students, less time, to 
complete the program than the average stu- 
dents. Individual differences may turn out to 
be only a very minor concern for rogram- 
ers of self-instructional courses. 18 


Provision for differences іп intelligence is but 
one aspect of the problem. In a comprehensive 
Study of variability of exceptional high school sen- 
iors сі Science and other academic areas, Ander- 
воп19 and others found considerable overlapping in 
achievement of low and high groups based on intel- 
ligence test scores. Тһе findings Suggest that an 
acceptable criterion for exceptional performance in 
Science and other academic areas must be Sought 
outside the province of intelligence. Apparently 
there are forces other than intelligence at work 
leading to exceptional achievement such as motiva- 
tion, originality and creativity. If this be true and 
iflinear programming assumes that students begin 


a program with fairly equal backgrounds in a sub- 
ject such as chemistry, thendifferentprograms are 
in order for students with different backgrounds in 
Science. The alternative to linear programming is 
branching for students of differing backgrounds. 

In connection with programmed teaching, it 
might be well to keep the following in mind: 


If the responses to be learned are suf- 
ficiently simple that a high proportion of the 
group can master them during the period of 
learning, the variability of the group becomes 
less; but if the task is complex, involving the 
higher mental processes to the extent that the 
abilities of the most apt members of the group 
are taxed during the period of learning, then 
the variability of the group increases. When 
learning is structured and meaningful and in- 
volves unlimited goals, and when the abilities 
of all members of a group are taxed, individ- 
ual differences increase during the period of 
learning. The conclusion seems justified that 
the emphasis on the process of striving for 
limited goals, homogeneity of achievement, 
and getting all pupils over the passing mark 
tends to encourage teachers to set limited 
goals for instruction which results in tempo- 
rary factual learning. Our conclusion must 
be that the more effective the instruction — 
the more adequately we meet the needs of all 
pupils — the more heterogeneous groups be- 
Пан ТТІ ПІЛ 7 лу 


Students do differ in intelligence and in previous 
training for a particular Subject; and yet some stu- 
dents, despite low rankings on these variables, do 
achieve more than those students whocome to apar- 
ticular class better equipped. The answer to this 
apparent over-achievement probably revolves about 


such factors as motivation and interest. Edwards 
has this to say: 


Immediate purpose currently is used by 
teachers to involve the learner temporarily - 
the teacher does not depend solely upon this. 
New purposes must be established as the 
learning Proceeds, and this purpose tends to 
become more general, broader, more inclu- 
Sive of large-scale instructional goals. The 
question arises as to whether programs cur- 
rently available or to be available will contain 
the same Possibilities. A program can assure 
Sequence, success, immediate learning goals. 

» If restricted to this, the teacher must plan and 
teach still in sucha Way as to expand purpose 
and keep it active. 

As with purpose in learning, this goal of 
Permanent interests will still depend upon 
teacher interaction with students. It is really 
not the purpose nor the scope of machines to 
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accomplish such goals as these. This, then, 
is not a requirement of programming. It il - 
lustrates the fact that teachers will still be 
faced with many of the same problems using 
machines as they are today and in certain ways 
these problems will be intensified. 

All motivational elements are not outside 
the sphere of influence of programmed ma- 
chines, however. As a matter of fact, some 
of the strongest points to be made for machines 


lie in reward and information-giving. There 2 


is great appeal to the fact thata learner knows 
immediately whether his performance is cor- 
rect or incorrect. Whenthis responseis cor- 
rect, the reinforcement proves beneficial to the 
learning. Thisis probably the strongest point 
to be made for incremental steps in the pro- 
gram, sincethere must bea very low percent- 
age of incorrect responses. Forteachers who 
subscribe to a reinforcement theory of learn- 
ing, this point alone makes the programmed 
learning approach worthwhile. There isa 
point to be made, however, for the fact that 
learning can occur through failures as well as 
successes. 


Ina вїшау!б cited previously with regard to the 
Encyclopaedia Britannica Chemistry Films, it was 
found that: 


When gains were compared on the Anderson 
Chemistry Test from the pre-test to the mid- 
year test and from the mid-year test to the 
post-test, there was some indication that the 
non-film groups were somewhat superior and 
that the film groups suffered a greater drop in 
achievement during the last half of the year 
than did the non-film groups. The students ' 
reaction would seem to indicate that too many 
films were shown and that boredom had set 
in. This may account for the film groups’ 
greater drop inmeangain during the last half 
of the year. 


Edwards has this commentto make about interest: 


* 

The second element concerns interest. 
Most learners are attracted by the novel, the 
new, without regard really to learning mate- 
rial itself. Such appeal to interestsalso may 
act as a wedge tolearning andis utilized fre- 
quently by teachers. There is no doubt that 
teaching machines, when first used in aclass- 
room, will have such appeal. AS а result, 


increments in learning may be Shown superior 
to that found in the regular classroom proce- 
dures. But novelty loses its appeal. Under 

rements inlearn- 


such circumstances, the inc 5 
ing will disappear. Teaching machines mus 


demonstrate more than just immediate 


appeal. 17 


In addition to motivation and interest, elements 
more specific to learning should be considered rel- 
ative to programmed teaching suchas rate of learn - 
ing, practice, review, retention, and transfer and 
generalization. ‘‘Teaching machines, tobe accept- 
ed educationally, must provide for a transfer pro- 
cess, or it must be explicitly stated that teachers 
must still make provisions for transfer. iid 

Before we adopt machines and program med 
teaching as an integral part of the teaching process, 
it would be well that we turnto educational research 
for good answers. We must look for well-designed 
studies that consider many of the variables dis- 
cussed previously. Appropriate to the topic is the 
following: 


Іп an experiment based on the assumption 
= of controlling all factors except the one under 
investigation, it is often observed that results 
will change from one experiment to another 
of the same kind. If all the essential condi- 
tions are varied simultaneously rather than 
one at a time, one can observe the effects of 
the factors in a nearly natural setting. Fac- 
torial design is appropriate because the effects 
of the interactions of all combinations of fac- 
tors under consideration are measured. . . 
The chief advantages of factorial experiments 
are: (1) greater efficiency, (2)greater 
comprehensiveness in that effects and inter- 
actions are estimated, and (3) that the conclu- 
sions have a wider inductive basis. 

Itis possible to partial out ina factorial 
design, such as the above, the effects of pre- 
science information and intelligence. This 
would call for à factorial design involving 
analysis of covariance. 

Consideration of designs of the factorial 
type before the investigation gets underway 
might make the research efforts more fruitful. 

...the research worker in science edu- 
cation should: (1) use adequate and reliable 
statistics when the research calls for de- 
scription, (2) use statistical techniques prop- 
erly by testing assumptions basic to the tech- 
niques, and (3) select anduse statistical tools 
in the early stages of an investigation. Іп ad- 

hworker in science educa- 
[ new statistical tools 
utions available for exact 


science education по 
significance but interms 


mation. 
Finally, the science researcher should use 


the powerful tool of analysis of variance and 
covariance to polster the controlled experi- 
ment in science education, and insofar as 
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possible consider in future studies the possi- 
bilities of varying all the essential conditions 
simultaneously by designs of the factorial type 
So that our findings will reflect natural set- 
tings and thus have wider applicability in our 
Science teaching. When this becomes an ac- 
complished fact, science teaching via realis- 
tic research will improve immensely. 19 


The American public school is changing rapidly 
to meet the demands placed upon it by the twin ex- 
plosions in populationandknowledge. Major efforts 
to influence the scienceand mathematics curricula 
by all levels of government, by professional socie- 
ties, by institutions of higher learning, and by an 
impressive array of National committees are mani- 
fest on every hand. This changing educational cli- 
mate has been brought about by a complex of factors 
including the threatto national survival, a maturing 
Scientific and technological society, and a reluctant 
abandonment of a traditional national provincialism. 

** Although the changes in science and mathemat- 
ics, both actual and proposed, are of major propor- 
tions, this fact has not producedany pronounced 
shifts inthe nature or amount of the related educa- 
tional research produced. Infact, oneis impressed 
by the sparsity of the evidence adduced which would 
either affirm or deny the efficacy of many of the large 
experimental programs. Thereis obviously a need 
for a more thoroughgoing consideration of evalua- 
tive techniques and procedures and for the planning 
of comprehensive educational research efforts as 
part of these experimental programs which commit 

vast sums of money and the time of millions 
of pupils. 

It is within this flux that programmed teaching 
must find a place in the public schools. It might 
well be that programming will prove to be quite 
Mmi. to changes in course content and curricu — 

um, 

We who are responsible for the education of 
American youth must explore every possibility to 
improve the educational process to the end that a 
course of action is forthcoming which willlead to 
greatness for the American education System in the 
last half of. this twentieth century. To do so is to 
Survive. To do otherwise is to perish. 
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An Investigation of the Personal, Social, Educational, and Economical 
Reasons for Success and Lack of Success in School as 
Expressed by 105 Tenth Grade Biology Students 


Purpose and Need for the Study 


MORE THAN two decades have passed since ed- 
ucators began to study setiously the reasons why 
students drop out of highschool before gradu- 
ation. 179* In Spite of the increased enrollmentin 
the secondary school due to the rise in the birth rate; 
the great prosperity of the “Fabulous Fifties”; and 
the wider range of curriculum offerings in our Amer- 
ican schools, the steady rate® of drop-outs in the 
tenth and eleventh year is an indication that the prob- 
lems involved here still have not been solved. Each 
teacher in the public high schools who has soph- 
omores and juniors has his share of pupils who are 
“waiting it out until they are sixteen” or until pa- 
rental resistance to leaving school can be overcome. 
Each of these boys andgirls represents a failure on 
the partof the schools to meet students needs. Some- 
where along the line, the school has lost them and 
while there are Sporadic spurts of interest and mo- 
tivation, ultimately they flounder and give up. 

Bell? Studied 10, 858 students who had left school 
before graduation, andthe following are the reasons 
they gave for leaving school. 


Reasons Given Ғы ouk Т 
Economic reasons 54.0 
Lack of family funds 34.1 
Desire to earn own money 15.7 
Needed to work at home 4.2 
Lack of interest in school 24.6 
Lack of interest 20.6 
Disciplinary trouble 2.2 
Subjects too difficult 1.8 
Feeling of completion be- 
fore graduation 13.2 
Poor health 3.2 
To marry 3.0 
Other Reasons 2.0 


ЖАП footnotes will be found at end of article. 
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Twenty-two thousand (22,000) school drop-outs 
were studied by the Bureau of Labor Statistics in 
1958 and the investigation was reported by Wolfbein,8 
The table on the next page shows reasons for leaving 
as listed by students and by school records.9.. ё 

Teachers and guidance directors questioning boys 
and girls who are leaving school today are aware 
that their reasons for leaving are very much the 
Same as they were twenty yearsago. It would seem 
that there is a great necessity for educators to ex- 
amine and re-examine factors affecting the success 
of youth inschool. If these factors have not changed, 
then it is necessary to investigate why theseare 
factors controlling success or lack of success in 
School. 

The purpose of this study is to investigate the 
personal, social, educational, and economic reasons 
for success and lack of success in school as 
expressed by 105 tenth-grade biology students. 


Specific Problems 


Three specific problems must be studied to find 
the solution to the main problem stated in Purpose 
and Need for the Study. 

Problem One: To study subjective opinions of high 
school sophomores as to factors determining suc- 
cess in school, other than intelligence. 

Problem Two: To design a questionaire which re- 
flects subjective student opinionas to factors deter- 
mining success in school. 

Problem Three: To determine factors affecting 
Success of high school sophomores in biology. 


Definition of Terms 


Success: Lack of success in school could not be 
defined as ‘‘drop-outs” in this investigation since 
all students finally partic ipating were stillenrolled 
When the data was collected (although 14 out of the 
119 students who had originally been part of the study 
had left school), Instead, success or lack of success 
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is defined in terms of student concepts of what suc- 
cess in school means. To fail or almost fail a sub- 
ject or grade or to dislike it indicated lack of success 
to the students in this investigation. 


Reasons: Explanations of factors controlling or 
affecting success inschoolas stated by students and 
interpreted and analyzed by the investigator. 

Personal reasons refer to explanations of factors 
such as family or emotional problems, and aptitudes 
affecting success in school. “ 

Social and economic reasons refer tofactors such 
as family, social, educational, and economic back- 
ground (ex. Parents’ occupations, parents’ educa- 
tion, vocationalgoals, employmentafter school) af- 
fecting success in school. 

Educational reasons refers to past school experi- 
ences (ex. subjects liked bestand least, grades liked 
best and least, reasons for these preferences, rea- 
sons for failure). 


Intelligence: 1. 0. scores as measured by the Otis 
Self-Administering Test of Mental Ability, Higher 
Examination: Form А 


Students: 105 sophomores enrolled infive Biology 
1 classes taught by the investigator at Wm. L. Dick- 
inson High School, Jersey City, New Jersey during 
the semester, February to June, 1959. 


Delimitations + 
телпшапопз 


This investigation does not propose that the stu- 
dent sample involved in the study is representative 
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of the student population in the United States. It 
represents, instead, a low medium income group 
living in an industrialized, low renthousing area in 
a large metropolis in New Jersey. 

Itis recognized by the investigator that students’ 
opinions as to factors affecting success in school 
arenecessarily biased and perhaps lack perspective, 
but they can be very usefulas a projective technique 
to discover underlying causes of failure. 


Procedure and Results 


InMarch, 1959, students infive Biology 1 classes 
taught by the investigator at Wm.L. Dickinson H igh 
School, Jersey City, New Jersey were asked if they 
would participate in a study of factors determining 
success or lackof success in school in general, and 
in biology inparticular. Forty-five minutes of spon- 
taneous discussion in each group suggested many 
social, economic, personal, and intellectual impli- 
cations of the problem. Having just finished a unit 
on the Scientific Method in which methods of educa- 
tional research as well as scientific research had 
been examined, rapport had beenestablished, and 
the students were eager to participate. The discus- 
sion period proved to be a catalyst to the students, 
stimulating the free expression of their ideas ina 
written composition the next day. 

From the opinions and revelations included іп the 
compositions, and from notes taken duringthe oral 
discussion, a questionnaire was designed by the in- 
vestigatorlÜ which included all items expressed. 

The questionnaire was administered at the end of 
the semester after the marks of the biology course 


REASONS FOR DROPPING OUT OF SCHOOL 


Mar- Military Adverse School All 


Reached To goto 


Boy Drop-outs Total age of 16 work 


As given in 

school records 100% 19% 23% 
As given by 

the drop-outs 4 

themselves 100% 6% 25% 


Girl Drop-outs 


As given in 

School records 100% 16% 13% 
As given by 

the drop-outs 

themselves 100% % 12% 


тізре Service experiences other 
5% 24% 1% 

6% 38% 25% 

22% 21% 28% 
24% 31% 26% 


uu ee 
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TABLE I 


SOCIO-ECONOMIC BACKGROUND OF BIOLOGY STUDENTS IN STUDY 


Father Occupation N=89 Percentage 
SSS sv uou d. 22... Percentage 
1. Unskilled Laborer 36% 
2. Skilled Laborer і 35% 
3. White Collar 20% 
4. Professional 9% 
Does Your Mother Work? М-99 қ Percentage 
1. Yes 41% 
2. No 53% 
Mothers' Occupation N-47 Percentage 
i ее oc Percentage 
1. White Collar 55% 
2. Unskilled Laborer 38% 
3. Professional 4% 
4. Silled Laborer т 3% 
Who Minds the Children While Your Mother Works? N-47 Percentage 
Ою Ооз ee Works? NEAT —— percentage 
No one 10% 
Relative or Grandparent 30% 


" 
What is Your Vocational Choice? N-89 


Percentage 
1. Professional 44% 
2. White Collar 19% 
3. Skilled Labor 8% 
4. Athlete 4% 
5. Artist 3% 
6. Armed Forces 195 
7. Unskilled Labor қ 1% 
8. Undecided 4 20%, 


Language Other Than English Spoken in Home. N-105 


зен Ai Bercentage 
Yes А 


48% 
No 52% 


ШЕ сзсз кошы —— — — — - 
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TABLE II 


PERSONAL BACKGROUND OF BIOLOGY STUDENTS IN THE STUDY 


ee 


Place in Family Group N=105 % 
Only child 
Eldest ^ 
Youngest ч и 
Middle 32 


Attitude Toward Siblings N-94 


Other child favorite 33 
Resentment of sibling 37 
Affection for sibling 18 
Dominate 39 
Submissive 51 

Health N=104 
Excellent 33 
Good 47 
Fair 17 
Poor 3 

Reasons for Failure N=79 
Teacher * 39 
Uninterested in subject 35 
Subject too difficult 26 
Personal trouble 20 
Interest in one of opposite sex 19 
Nervousness during tests 15 
Laziness 8 
Work after school 8 
Rebellion against rules 6 
Not caring enough 30 

Parental Attitude N-105 

Toward School 

Encouraging 54 
Helpful : 50 
Cooperative | 34 
Lack of restriction 17 
Uninterested 1 
Interested T 8 
Lack of cooperation 8 
Too much restriction, 1 
Too much supervision 5 
No encouragement à 


Resentment toward school 


(ene ee лы о. 


_ ———— 
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Were closed. The students were assured that their 
responses would remain confidential and that their 
answers would in no way affect their grade for data 
would not be tabulated until а later date. Since the 
Students had been very frank and uninhibited in the 
expression of their ideas inthe original composition, 
this was probably an unnecessary precaution, but 
it limited, as far as possible, biased answers. 

During the discussion period, one of the q u e stions 
raised was whether intelligence really was an im- 
portant factor determining success in school. Many 
instances were pointed out of students who were 
bright and didn't succeed because of lack of drive, 
personal problems, or economic problems and of 
duller students who achieved because they were in- 
dustrious and were motivated to succeed. 

Studies have indicated that there is a very signi- 
ficant relationship between intelligence and continu- 
ance of the individual in secondary school, Kefauver 
states, 


Studies of the intelligence of high school pu- 
pils have shown beyond question that they are, 
in this respectalso, aselected group as com- 
pared pe the total population of high school 
age. 


He further reports a gradual rise of the median 
1.0. 's from the ninth grade 99.tenth grade 101, 
eleventh grade 103, and twelfth grade 105. 12 Samler 
found the mean 1. Q. of high school graduates to be 
105. 6 and that of the drop-outs to be 96. 3. 13 Wolf- 
bein 14 found that the proportion with I. Q. scores 
less than 85 among drop-outs was triple that among 
graduates and the proportion with Scores of 110 and 
Over among graduates was almost triple that among 
drop-outs. 85% of the drop-outs were behind their 
age group by at least one year and 53% by two or 
more years of schoolwork. Helists the percentages 
of the 22, 000 drop-outs as 10% under sixteen years, 
34% atsixteenyears, 27% atseventeen years 17% at 
eighteen years, and 12% at 19 years old. 

The mean age for the biology students studied in 
this investigation was 15, 8 years. 

The students in all three biology classes were 
given the Otis Self-Administering Test of Mental 
Ability, Higher Examination: Form A. during one 
class period near the end of the semester.I. Q. 

Scores were derived for eachofthe ninty-five stu- 
dents presentthat day. 15 The mean of the group was 
found to equal 101.8 + -1.3 and the standard devia - 
tion equalled 11. 7 + -. 84, 

Lack of success inschool could not be defined as 
drop-outs in this investigation since all Students fi- 
nally participating were still enrolled when the data 
was collected (although 14 out of the 119 Students who 
had originally been part of the study had left School), 
Instead, finalgrades in the biology course were used 
aS one criterian of success. The mean of the final 
;rades in Biology I was foundto be 76.8 +-, 99 with 
. Standard deviation of 9.3 + -.67. 

An attempt was made to determine if Significant 


relationship existed between ability in biology as de- 
termined by a final grade andI.Q. scores. 

Wood's Correlation Coefficient Form 16 was used 
to determine Pearson's r for correlation between 
Ptology final grades and I.Q. scores for 95 stu- 
dents. 17 Anr of .48 was derived. According to 
Garrett, 18 anr of . 40 and above denotes substantial 
or marked relationship between two tests, and so it 
must be assumed by this investigator that the I. Q. 
Scores hada marked or substantial relations hip to 

» success (as determined by the final grades)in the bi- 
-ology course. 

An analysis of the results of the questionnaire 
might indicate some of the factors other than intel- 
ligence which affects the success in school of the 
students in this investigation. 


Socio-Economic Background 


The largest part of the school population is de- 
rived from the industrial areas of the city. This is 
reflected by the large proportion of the fathers who 
are employed as laborers; 35% of which couldbe 
classified as skilled and 36% as unskilled, accordi ng 
to Rath’s classification;19 thus accounting for. 71% 
of the fathers inthe investigation. Forty-seven per- 
cent of the mothers supplemented the family income 
by part time or full time employment. Only 9% of 
the fathers and 4% of the mothers were inoccupations 
requiring professional training. Seventy per cent of 
these boys and girls who had both parents working 
had по Опе at home to supervise them or their broth- 
ers or sisters after school. Thirty-seven% of the 
fathers and 32% of the mothers of these biology stu- 
dents had dropped out of school at the eight grade 
or before, and 48% of the fathers and 58% of the 
mothers had had some Secondary school education. 
Only 10% of the mothers and 15% of the fathers had 
attended college and few of these had graduated with 
degrees. Although 44% of the students claimed to 
desire professional careers, it is unlikely that this 
many were really serious about their vocational 
Choice. Many doubted that they would be able to af- 
ford college and stated alternate choices in white 
Collar occupations. Research indicates that voca- 


tional goals are established by thefamily group. 
Rogoff states, 


One of the most recurrent findings in pre- 
vious research on occupational mobility is that 
Sons are more likely to enter their father's 
Occupation than to enter any other single oc- 
cupation, 20 


In his study of success and lack of success among 
geniuses, Terman found among students of equally 
high 1.Ө., twice as many that succeeded in high 
School and college whose parents hadgraduated from 
college, as those whose parents had a non-college 
background. According to Terman, 


Fathers of the A’s (successful geniuses) were 


: 
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TABLE Ш 


FACTORS AFFEC TING STUDENTS ATTITUDES TOWARD GRADES LIKED BEST AND LEAST 
ОСЕЛЬ НВА Саани аА) 


Grade Liked Best — N-105 % 
Eight 49 
Ninth 14 
Tenth 17 
All Others 21 
Why Students Liked This Grade Best 
Fun 51 
Teacher 46 
Work was Interesting 42 
Good Marks 28 
Work was Easy 13 
Other Students 12 
Grade Liked Least N=100 
Eighth 8 
Ninth . 82 
Tenth 16 
All Others 44 
Why Students Liked This Grade Least 
Poor Teacher 37 
Work Boring 32 
Unpleasant Atmosphere 23 
Poor Grades 21 
Hard Teacher 18 
Work Hard т 


Too Much Work 
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TABLE IV 
FACTORS AFFECTING STUDENT ATTITUDES TOWARD SUBJECTS LIKED BEST AND LEAST 
a SS 


Subject Liked Best N=105 % 
Science 2 
Social Studies 18 
Mathematics 16 
English 10 
Languages 10 
Miscellaneous 


Why Students Like This Subject Best 


Interesting 70 s 
Teacher 4 32 
Aptitude 30 
Usefulness 21 
Easy 10 
High Mark 1 


Subject Liked Least N=103 


Mathematics 30 
English 29 
Social Studies 19 
Languages 9 
Science 7 
Miscellaneous 6 


Why Students Liked This Subject Least 


Boring 51 
Too Difficult 30 
Teacher 26 
No Aptitude 21 
Not Useful 10 
Too Much Work 5 
Poor Grades 1 
- È 
TABLE V # $ 


FACTORS INFLUENCING ATTITUDE OF STUDENTS TOWARD BIOLOGY COURSE 
———Є————Є—_. 
—————Є—Є——Є———————————————Є— 


Attitude Toward Biology Course N=93 % 
Liked Biology 74 
Tolerated Biology 18 
Disliked Biology 8 


Factors Influencing Attitude Toward Biology 


Subject Matter Interest 69 
Teacher 61 L 
Atmosphere in Class Р 32 
Usefulness of Subject 31 
Amount of Work ^ 25 
Gades Received 29 
Vocational Goals 24 
Aptitude for Biology 20 
Level of Difficulty 16 
Background of Science Information 16 
Personal Problems 10 
Laboratory 10 
New Subject Matter 2 
Drawings 


far more ofteninthe professionalclasses. The 
important point here is that the educational 
tradition was stronger in families of the A 
group. 


і Many оѓ the biology students inthis study reflected 
a lack of motivation in school due to parental atti- 
tudes. In their discussion and in their composition, 
parental opinion was a dominating influence. Many 
stated, ‘‘I try to get good marks because it pleases 
my mother во much,"' ог, ‘I guess I don't really 


try. My parents don't саге too muchabout grades | 


in school or whether or not I do homework. ’’sev- 
enteen per cent complained of lack of restriction in 
parentalattitude, 8% of lack of cooperation, and 11% 
of lack of interest. Fifty per cent described their 
parents as helpful, 54% as giving encouragement, 
and only 8% felt their parents tobe really interested. 
It would seem that a great deal of **educational tra- 
dition" is missing in the home of the youngsters in 
this study. 

Another factor that must be taken into consider- 
ation in analyzing the lack of **educational tradition" 
among these students is the large proportion of par- 
ents who speak a foreign language in the home. 48% 
Spoke a language other than English, and 52% spoke 
only English. Eight per cent felt that the jobs they 
held after- school to help supplement the family in- 
come was one of the maincauses of their failures in 
School. 


Personal Background 


Family or Emotional Problems: Only 10% were 
only childrenin the family, andof the other 94 stu- 
dents who had brothersand sisters, 33% felt another 
child inthe family tobe their parents’ favorite, 37% 
expressed anactive resentment against one or more 
of their siblings while only 18% felt affection for 
their siblings. When asked which child usually had 
to “give іп” in the family, 51% felt it was they who 
Were submissive, while 39% admitted they dominated 
the other children. ' 

When asked to describe their general health, 3% 
felt they were in poor health, 17% fair, 47% clas- 
Sified their health as good, and 33% as excellent. 
Twenty per cent had serious personal troubles that 
caused them to neglect their studies, 19% were emo- 
tionally involved with someone of the opposite sex 
to such a degree that they felt it was an explanation 
for their failures in their courses. Fifteenper- 
Cent said they became so nervous during tests that 
they couldn't pass. Thirty per centblamed not caring 
enough for their failure, and6% admittedthey were 
rebellious againstthe rules and regulations of home 
and school. 

The emotional problems of adolescents have long 
been recognized by educators as serious blocks to 
efficient learning.Ina recent study done by ud 
and Moscovitch, in which the effect of emotiona 
Problems on performance in achievement tests was 


^ 
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determined, it was found that the emotional condition 
rather than the environmental factor or even the gen- 
eral intelligence of the child is the significant factor 
indetermining the performance of a childonan 
achievement test. 

Cook administered standardized tests to with- 
drawals and non-withdrawals and listed (as shown 
on the next page) the following as test scores геу eal- 
ing most significant differences between withdrawals 
and non-withdrawals. 

According to Raths’24, it is not the job of the 
teacher to meet the emotional needs of the children 
in class; this is primarily the responsibility of the 
parents, The teacher’s job is to promote learning. 
But if the frustrated needs of students seem to be 
blocking the learning process, the good teacher will 
want to do something about it. 

Within the framework of most classroom situa- 
tions, many of the emotional needs of youngsters 
canbe met once the teacher had developed an aware- 
ness of how these needs manifest themselves in stu- 
dent behavior, Raths lists practical suggestions of 
ways that teachers can meet the following needs: for 
belonging, for achievement, for economic security, 
for freedom from fear, for love and affection, for 
freedom from guilt, for self-respect, and for self- 
understanding. In his pamphlet, “Ап Application to 
Education of the Needs Тһеогу”, he describes typical 
behavior of young people reflecting unmet needs. 

By their answers on the questionnaire, many of 
the boys and girls in the investigation revealed emo- 
tional difficulties that might be factors affecting 
their success in school. Skillful guidance in helping 
them understand ego formation, in the developing of 
Self-acceptance, and direction in the achievement 
of mental objectivity might offer possible solutions 
to many of their difficulties. 


Educational Background 


Although 76% of the biology students had failed 
one or more subjects at one time or another in their 
school career, 74% claimed they liked school most 
of thetime. Interest was by far the most important 
factor in determining whether the youngsters liked 
or disliked a subject or grade; the difficulty of the 
work, marks received, usefulness of the information, 
andpersonalaptitude seemed to be subordinate fac- , 
torsin the opinions of these students. The fun they 
had in school and the kindofteachers they had were 
the two other factors mentioned most often by the 
students in this investigation. 

Inexplaining why they liked their favorite sub- 
ject best, 70% mentioned that it was interesting, only 
10% liked it because it was easy, 1% because a high 
mark was received, and 21% because it was useful. 
Fifty-One per cent liked their favorite grade best 

use they had had fun. 
en қыш the transitionfrom the eighth grade 
jnelementary school to the ninth grade іп high school 


difficult. Forty-nine per cent said they liked the 
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eighth grade betterthanany other, and 32% selected 
the ninth grade as the one they liked least. 

Science rated highest among subjects liked best, 
with 29% of students selecting science and 19% cho- 
osing social studies as theirfavorite subject. Math - 
ematics and English seem tobe the subjects disliked 
most, with 30% expressing their dislike for math, 
and a close 29% for English. 

Difficulty of a subject in itself does not seem to 
be an important factor affecting failure according to 
the opinions of these students, Eighty per cent claim- 
ed they did not give up when a subject was difficult, 
and 50% said they were actually motivated by diffi- 
culty. 4 


Factors Affecting Success of High 
School Sophomores in Biology 


Although there was a15. 1% failure and 27, 4% near 
failure (grades between 70-72 and failing one mark- 
ing period) for the term in Biology among the stu- 
dents in this investigation, 74% stated that they liked 
the Biology Course. Again interest in subject matter 
Seemed to be the most importantfactor, with 69% 
Checking off interest as an influence on attitude to- 
ward the course. 


Sumary and Conclusions 


1. The 1.0. scores ofthe students in the inv es- 
tigation hada marked relationship to success 
(as determined by tinal grades) in the Biology 
cour se. 

2. The socio-economic background of the stu- 
dents in the study lacked “educational tradi- 
tion" which probably limits, to some extent, 
motivation to Succeed in school. 

3. Ву their answers onthe questionnaire, many 
of the boys and girls in the investigation re- 
vealed emotional difficulties that might be 
factors affecting their success in school. 

4. Interest was, by far, the mostimportantfac- 
tor in determining whetherthe Students liked 
or disliked a subject ora grade; the difficulty 
ofthe work, marks received, usefulness of 
theinformation, andpersonal aptitude seemed 
to be subordinate fact ors in theopinionsof 

these students. The fun they had in School and 


4. 


о 1 


T Test Non-withdrawals Mean Withdrawals Mean Difference a 
Bell Adjustment Inventory 7.25 10. 25 3.00 
S. R. A. Youth Inventory 
My school 6.32 9.09 2.77 
My Home Family 3.44 8.27 4.83 
Health 2.92 4. 68 1.78(28) 


the kindof teachers they had were the two other 
factors mentioned most often by the students. 
Students’ favorite grade was the eighth, and 
their favorite subject wasscience. The ninth 
grade was the one liked least, and Mathe ma- 
tics andEnglish were subjects disliked most. 
Although there was a 15.1% failure anda 
27. 4% nearfailure (grades between 70-72 and 
failing one marking period) for the term in 
Biology among the students in this investiga- 
tion, 74% stated that they liked the Biology 
course. Again subject matter interest and 
teacher influence ranked high in their opinions 
as to reasons for attitude toward subject. 
Factors affecting lack of success in school 
today are very much the same as they were 
twenty years ago, in spite of economic pros- 
perity, modifications in school curriculum, 
and increased schoolpopulation. Further ed- 
ucational research is indicated with greater 
emphasis on why these are factors, rather 
than what the factors are. 
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An Analysis of School Subject Preferences of Elementary 
School Children of the Middle Grades 


THIS STUDY represents an investigation of the 
relative popularity of certain school subjects among 
children of the third, fourth and fifth grades; the 
relationship that exists between the choices of chil- 
dren and variables of sex, achievement in the sub- 
ject chosen, andlevel of intelligence; and the degree 
of similarity between the subject preferences of chil - 
dren and their teachers. 


Review of the Literature 


Examining subject preferences of fourth through 
sixth grade pupils, Mosher (1) found little significant 
difference between the first choices of boys and girls. 
Arithmetic, art, social studies, and spelling were 
among the top-ranking selections. 

Dean, (2) investigating the relationship between 
Subject preference and achievement of fifth grade 
children, reports no statistically significant finding 
though boys indicated strong preferences in their 
Selection of arithmetic, spelling, and reading. 

Inafollow-up study of Boston fifth grade children, 
Chase and Wilson (3) found reading, art, Spelling, 
and social studies to rank high in terms of pupil pre- 
ference. They report that boys show a significantly 
stronger preference for social studies than do girls 
but report no other significant preference on the 
basis of sex. Comparing teacher preferences with 
student choices a strong relationship was found to 
exist in all subjects except art and spelling. 


Procedure 


Approximately three hundred children of the 
middle grades were asked to list, in order of pre- 
ference, their favorite school subject. First choices 
were allocated three points; second, two points; and 
third choices, one point. Inthismanner a total point 
accumulation was derived for each of the subjects. 

Data relative to level of intelligence and achieve- 
ment in the areas of reading and arithmetic were 
secured from the California Testof Mental Maturity 
and the California Achievement Test, each admin- 
istered in September, 1959. 


E.L. GREENBLATT 
Los Alamitos School District, California 


Subject Preferences 


On the basis of total point accumulation (Table II) 
art, arithmetic and reading enjoy a substantial de- 
greeof preference. Todetermine if the subject hier- 
archy obtained was statistically significant a series 
of critical ratios was prepared. (Table III). 

The numerical advantage held by art and arith- 
metic over all subjects, withthe exception of social 
studies, is significant atthe . 01 level of confidence. 
Reading shares the position of art and arithmetic, 
exceeding others (except social studies and spelling) 
in terms of pupil preference. 

It is interesting tonote, that although art, arith- 
metic, and reading were not found to be significantly 
preferred over social studies, social studies was not 
found to be more desireable than any of the subjects. 

Science and music appear to occupy an interme- 
diate position whereas writing, language, and health 
are, beyond question, the least preferred in this 
study. 


Sex and Subject Preferences 


Certain conclusions are evident when examining 
subject preferences onthe basis of sex. The finding 
(Table IV) that boys prefer science and girls music, 
is significant at the 01 level of confidence. The pre- 
ference for social studies indicated by boys is sig- 
nificant at the .05 level of confidence. 

To determine the degree of correspondence be- 
tween the selections of boys and girls, rank order 
correlations were calculated. (Table V). Though a 
significant relationship is apparent, it becomes in- 


ШЕР Strong at the fifth grade level. (Rho equals 


Achievement and Subject Preferences 


Chi square tests of the Significance between 
achievement in reading and arithmetic and choice of 
these subjects were computed. (Table VI). Pupil 
performance on the California Achievement Test in 
the subjects servedas an indication of a child's de- 
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viation from grade level expectancy. 

Insofar as reading is concerned no relationship 
was discovered between achievement of the pupils 
and the degree to which they preferred it. Girls, 
however, appear to choose or not choose arithmetic 
in terms of their achievement in the subject. This 
finding was significantatthe . 02 level of confidence. 


Intelligence Level and Subject Preferences 


To examine the relationship between choices of 


children and level of intelligence, two groups were 
formed. Group I consisted of children with an IQ of 
110 orhigher, Groupllof children with an IQ of 91 or 
lower. Table VII reports the results of rank order 
correlations between the choices of boys and girls in 
each group and of the total sample of each group. 

Whereas a wide divergence of opinion exists be- 
tween the choices of boysandgirls in Group II, there 
is a marked similarity in subject preferences be- 
tween the sexes in Group I. The agreement in this 
case is much stronger than that of boysand girls of 
grades three and four (Table V). 

In the matter of subject preference Group I chil- 
dren rank arithmetic first; art, second; and reading, 
third. Group П pupils list in order of preference 
art, reading, and spelling; music, arithmetic, and 
science occupying third position. 

Health and writing were omitted by Group I chil- 
dren. Group II failed to select health, writing, social 
studies, and language. 


Teacher and Pupil Preferences 


Instructors were asked to indicate the school sub- 
ject they most enjoyed teaching. As in the case of 
the children, points were assigned their selections 
on the basis of first, second, or third choice (Table 
ҮШ). Grade level distribution of teachers was as 
follows: thirdgrade, three; fourth grade, three; and 
fifth grade, four. Nine of the instructors were female. 

Rank order correlations (Table IX) indicate strong 
agreement (rho equals .90) between preferences of 
teachers and pupils of Group I. This finding is sig- 
nificant at the .01level of confidence. Though a re- 
lationship appears to exist between choices of teach- 
ers and other segments of the sample, the strength 
of the correlations are not sufficient to warrant state- 


ments of a predictive nature. 
Summary 


1. Art, arithmetic, andreading were found to be 
significantly preferred over other subjects with 
the possible exceptionof spelling and social stud- 
165. 

2. The position of social studies is somewhat 
nebulous, in that it was not significantly more 
preferredthan the least desireable of the subjects, 
nor were subjects of high preference (art, arith- 
metic, and reading) foundtobe significantly high- 
er in regard than social studies. 

3. Boys show a significant preference for sci- 
ence; girls for music. 

4. A significant relationship exists between 
achievement of girls in arithmetic and their sel- 
ection of the subject. 

5. A strong correspondence was found between 
subject preferences of boys and girls with an IQ 
of 110 or higher, andlittleor no relationship be- 
tween choices of boys and girls with an IQ 0191 
or lower. 

6. Children of the higher intelligence group show 
a wider diversity in subject selection than do pu- 
pils of Group II. 

7. The preferences of teachers correspond most 
strongly with those of the children in the higher 
intelligence group. 


Analysis of subject preferences of children isa 
gross measurement at best. However, it may pro- 
vide important clues pertaining to the identification 
of derived satisfactionsof students from school cur- 
ricula. 
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TABLE І 


ANALYSIS OF STUDENT POPULATION 


Third 38 44 82 
Fourth 40 41 81 
Fifth 59 60 119 
= — s o eu MSS bet eins i 
Total 137 145 282 


TABLE II 


POINT ACCUMULATIONS FOR SELEC TED SUBJEC TS 


ee 


Subject Points Assigned 


EI cu eS ea 


Art 


341 
Arithmetic 309 
Даш 274 
Spelling 207 
Music 187 
Science 154 
Social Studies 98 
Writing 45 
Tanbuses 41 
Health 26 
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TABLE IV 


SUBJECT PREFERENCES ACCORDING TO SEX 


Subject Sex M SD SE SEd CR 
—— 02.2 GMT А. 
Art Boys 1.28 1.21. . 103 
Girls Tti 1.19 . 099 .142 1.27 
Arithmetic Boys 1.05 1.26 .108 
Girls 1.14 1.26 .105 .150 60 
Reading Boys .88 1.14 .970 
Girls 1.07 1.21 .100 . 980 19 
Spelling Boys . 68 1.00 . 086 
Girls .79 1.05 . 087 122 89 
Music Boys .49 .88 1.070 | 
Girls .86 1.08 . 090 ‚114 3. 302 | 
Science Boys .85 1.09 . 093 Г 
Girls .26 415 . 062 11 5.262 
Social Studies Boys .46 .96 . 082 
Girls .25 .69 ‚051 . 099 2. 13b 
Writing Boys .15 .50 .429 
Girls E .58 . 048 .432 .06 
Language Boys .10 .39 . 033 
Girls .18 .54 . 449 .450 19 
Health Boys .09 .36 . 030 
Girls .09 .40 . 034 .045 .11 
а Significant at the .0llevel of confidence. ———— — ——— — —— 


b Significant at the .05 level of confidence 


TABLE V 


RANK ORDER CORRELATIONS BETWEEN SUBJECT 
PREFERENCES OF BOYS AND GIRLS 


Boys 
Grade 3 Grade 4 Grade 5 Total 
"NICHE E i РА, ЖАЛ РЕНИЙ 
Grade 3 2699 
Grade 4 649 
Girls 
Grade 5 .91* 
Total 5 


.90 


а Significant at the .01 level of confidence 


b Significant at the . 05 level of confidence 
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TABLE VI 


CHI SQUARE TESTS OF SIGNIFICANCE BETWEEN 
ACHIEVEMENT AND SUBJECT SELEC TION 


Subject Boys Girls Total 
NR SO EUER вана АА „сыгы чаш 
Reading 1.128 1.492 2.039 
Arithmetic 2.110 8.141* 3.155 


a Significant at the .02 level of confidence. 


TABLE VII 


RANK ORDER CORRELATIONS BETWEEN SUBJECT PREFERENCES OF 
BOYS AND GIRLS IN EACH INTELLIGENCE GROUP 


амалын E ML скат ORIS 


Items correlated 
.80* 


Choices of Group I girls with Group Iboys 


Choices of Group П girls with Group II boys 431 

Choices of Group I boys with Group II boys .21 

Choices of Group I girls with Group П girls .40 
2109 


Choices of total Group I with total Group П 


the . 01 percent level of confidence. 


a Significant at 
the . 05 percent level of confidence. 


b Significant at 
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TABLE VII 


POINT ACCUMULATIONS FOR SUBJECTS 
CHOSEN BY THE TEACHERS 


Subject Points Assigned 

Social Studies 21 

Reading 12 

Arithmetic 9 

Music 7 

Science 4 % 
Art 3 

Spelling 2 

Language 2 

Writing 0 | 
Health 0 

o шыш. игш д ан ауы IT S e RET o s 


TABLE IX 


RANK ORDER CORRELATIONS BETWEEN SUBJEC T PREFERENCES ' 
OF TEACHERS AND PUPIL GROUPS 


Есш 


Total “Group I Group II АП АП 
sample (High IQ) (Low IQ) girls boys 
Rho .55 ‚15® .56 ‚625 ‚61° 


a Significant at the . 01 level of confidence. 


b Significant at the . 05 level of confidence. 
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Students’ Perceptions of Factors Affecting 


Test Performance 


ACONSIDERATION of the methods and techniques 
employed by students’ in their approachand prepar- 
ation for a test situation has not, as yet, brought 
into being a body of systematic research. In spite 
of the perfectionof testing tools, test situations are 
frequently perceived by both students and teachers 
as forms of punishment -- mild or otherwise, de- 
pending on the difficulty of the testing instrument-- 
rather than a learning experience. The student may 
approach an examination, and perceive the experi- 
ence as a mental challenge (Bloom and Broder, 
1950), setting up, initially, a problem-solving set, 
where the student has prepared himself in advance. 
In a second approach, the student may be hesitant 
because of lack of preparation, lack of confidence in 
applying what he knows, and/or past experience. 
Doubt as to the meaning of terms, misinte rpreting 
questions or directions, and the “1 never-do-well - 
оп- multiple-choice-questions'* attitude are preva- 
lent here. Their subsequent effects on test perfor- 
mance have been discussed by Bloom and Broder 
(1950, Earle (1950), and Gaier (1952a, 1952b, in 
press). 

The present investigation was initiated to examine 
the factors believed responsible for test performance 
as perceived by students aboutto take a written ex- 
amination. Specifically, this study is a first attempt 
to examine the approaches employed by а group of 
college students in writing their first class examin- 
ation. Attention was focused on the analysis of the 
reasons given by students, successful and unsuc- 
cessful, inaneffortto determine whether those who 
expected to receivea good test grade possess a test 
orientation different from those less hopeful of even- 
tual test success. 


Subjects and Procedure 


Prior to the distribution of test materials for the 
first quiz of the semester, 276 undergraduates en- 
rolled in sevenclasses of the Educational Psychology 
Sequence at Louisiana State University were request- 
ed to complete a form containing the following di- 
rections: 


Assume that you will receive aletter 
grade of (A)/(D) on the test you are 
to take. List the specific activities, 


EUGENE L. GAIER 
The University of Buffalo 


either on your part or on the part of 
the instructor, that you feel were in- 
fluential or responsible in making this 
grade. 


To avoid contamination from reactions to the test 
itself, the completed forms were collected before 
the examination was handed out. The present results 
are based on a group of 90 men and 186 women. Of 
this total group of 276, 136 students (90 women and 
46 men) were instructed to assume a letter grade of 
«А? in the test, while 140 subjects (96 women and 
44 men) were to presuppose a letter grade of “р”. 


Measures of Resourcefulness in Test Performance 


A system of classification of the reasons listed 
for earning the designated grade, categorized ac- 
cording to the source, Was developed. Each sepa- 
rate idea, in many instances a clause or phrase, 
served as the unit for classification. This unit of 
division was selected since it appeared that a class- 
ification based on the number of words written, or 
on all materials expressed at one point in the listing 
would possibly obscure of eclipse the thinking em- 
ployed by the student to support his actions for the 
grade received. This becomes clear when one ex- 
amines the following four protocols, two submitted 
by “А” subjects and two by “‘D’’ subjects: 


Subject 1: (Male, Age 19, Grade A) 

1.** Assuming that] receive the letter grade of 
* A! on this test, I would feel that the amount 
of preparation and studying that I haddone 
would be the greatest factor, By prepared- 
ness, I mean the amount of class lectures 
that I had attended, the putting to use of this 
knowledge on the test. М, 

2. ** The clearness and effective way the instruc- 
tor brought forth the test material would be 
of prime importance also. The instructor is 
precise in his explanations, and well posted 
in his subject matter. y. 

3. *Summarizing, I would say that the amount 
of effort, both studying and achieving lecture 
matter along with the effectiveness of the in- 
structor would have made the grade possible." 
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TABLE I 


FREQUENCY AND PERCENT BREAKDOWN OF REASONS CITED FOR 
SUCCESSFUL TEST PERFORMANCE 


Female Male Total 
Category N % N % N % 
I. Success Due to Self and 
Self- Activities 
1. Preparation andstudy 
methods 157 35.3 52 28.6 209 33.3 
2. Industry and motivation 57 12.8 31 17.0 88 14.0 
3. Class atendance and 
attention 21 4.7 4 2.2 25 4.0 
4. Physical and Psychological 
well being 18 4.0 5 2.8 23 3.7 
5. Comprehension and reason- 
ing ability 23 5.2 6 3.3 29 4.6 
6. Test understanding 18 4.0 1 5 19 3.0 
7. Academic and personal 
background _8 1.8 23 1.6 mit 1.8 
Sub- Total 302 67.8 102 56.0 404 64.4 
П. Success Due to Teacher 
1. Quality of Presentation 40 9.0 19 10.4 59 9.4 
2. Instructional Techniques 40 9.0 28 15.4 68 10.8 
3. Understanding classroom 
atmosphere 25- 5.6 п 6.0 36 5.7 
Sub- Total 105 23.6 58 31.8 163 25.9 
Ш, Success Due to External 
Factors 
l. Luck 8 1.8 3 1.1 
: 11 1.8 
2. Test Characteristics 17 3.8 8 4.4 25 4.0 
3. Previous courses 8 1.8 7 3.8 15 2.4 
Sub- Total .33 7.4 18 9.9 51 8.2 
IV.Denial of the Possibility 
of Receiving an A 
1. Due to Self 1 2 
: - = 1 .2 
2. Due to Teacher 3 .8 3 1.7 6 1.0 
3. Due to External Factors Ki 42 b .6 2 .3 
Sub- Total bs 1.2 4 2.3 9 1.5 
TOTAL 445 100.0 182 100.0 627 100.0 
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TABLE II 


FREQUENCY AND PERCENT BREAKDOWN OF REASONS CITED FOR 
UNSUCCESSFUL TEST PERFORMANCE 


Category 


Failure due to Self 


Ty 


2. 


Lack of preparation and 
poor study habits 

Lack of industry and 
maturation 

Class attendance and 
attention 

Tense, not mentally and/or 
physically well 

Not able to understand 
and to reason 

Not understanding tests 

Background experience 


Sub- Total 


Failure due to Teacher 


1. 
2. 
3 


өзі 


Quality of lecture 

Instructional techniques 

Interest of instructor and 
failure in obtaining 
rapport 

Personal characteristics 

Teacher's tests 

Unspecified 


Sub- Total 


Failure due to internal factors 


өле 


Interfering conditions 

Test characteristics 

Previous couses in 
psychology insufficient 

Characteristics of material 
studied 

Outside activities 


Sub- Total 


Not Classifiable 


TOTAL 


N 


Female 


wv 
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A со co 
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N 
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Subject 2: (Male, Age 20, Grade A): 

1. “Reading and outlining the test following 
the lecture material which complemented 
the test. "' 

2. “Тһів type of course pertains directly to 
тпез?? 

3. “Ргеуіочв course in psychology and а de- 
Sire to know more about child development 
are theactivities which have had the above 
effect. ” 

4. ** The instructor presented the subject mat- 
ter in such a way that it was enjoyable and 
practical. However, the side of the bed 
from whichthe professor arose this morn- 
ing may have had its effect. ” 

Subject 1:(Female, Age 20, Grade D): 

1. “І haven't been feeling well for the last 
week andhave spent time in the infirmary, 
and I didn’t have my books with ше.” 

2. “Іпеуег know howto study for a psychology 
test. They always make up multiple-choice 
questions where all the choices 1 ook cor- 
rect." 

3. “Тһеге has just been too much to do this 
week with Homecoming. ’’ 

4. “I never get good notes in this class. We 
coverthe materials, butI never am able to 
organize it for studying at home. ’’ 

5. “Тһе type of test may not be the kind I am 
very familiar with and does not seem to 
cover any of the subject matter thatI 
learned.” 

Subject 2: (Male, Age 22, Grade D): 

1. “We didn’t have enough discussion on the 
materials coveredin the test. The instruc- 
tor never follows the book, and I get very 
confused. ’’ 

2. “I really don’t like the course; I never like 

| any 8 o'clock class and I never can stay a- 
wake." 

3. ‘I guess that I should be taking something 
else where you can reach some definite an- 
Swer. In psychology, there is never any 
real answer--at leastnotin class. And on 
the tests, we have to give answers. ” 


It was considered helpful in ordering the data to 
make a general classification of the discriminations 
employed. These discriminations might be consid- 
ered, a priori, to be germane on the part of the 
Student to the actual test Situation, those generated 
by theinstructor, orthose factorsexternal or irrel- 
evant to the examination. 

A. TestSuccess The 627 reasons listed to sup- 
port a letter grade of “А” were grouped intofour 
major areas: 1) Success due to self (e. g. study 

methods, industry, motivation, class attendence , 
background); 2) Success due to the teacher (е. g., 
quality of lectures, instructional techniques, class 
rapport); 3) Success due to external characteristics 
(e.g., test characteristics, luck); and 4)Denial of 


the possibility of getting an “А”. Within each of 
these four major categories, additional sub-cate- 
gories were included as shown in Table I. 

B. Test Failure A parallel classification for the 
580 responses obtained from the ““Р”” subjects was 
developed. This included: 1) Failure due to self (е. 
g., lack of interest, anxious, lack of self-confid- 
ence, never goover a test when I have finished with 
it); 2) Failure due to the teachers, (e.g. , discussion 
not thorough enough, no class review, poor lecture; 
3) Failure due to external factors (e.g., too many 
exams on the same day, lackof time, questions too 
ambiguous and tricky, testtoolong). The sub-cate- 
gories included within the major heading are shown 
in Table 2. Of the total responces in this category, 
12 reasons were rejected as unclassifiable or in- 
complete. 


Results and Discussion 


The frequency and percent breakdown by cate- 
gories of the reasons cited in support of either a test 
grade of А or D are presented in Tables I and II, 
respectively. 

Success or Failure Due to Self -- Of the 404 re- 
sponses presented by the students, 67.8 of the 
reasons attributedtest success to personal charac- 
teristics and activities. The subjects felt that effec- 
tive usage of study time, comprehension and under- 
standing of the materials, the ability to apply the 
materials studied, as well as the ability to reason 
relevantly and judiciously would guarantee effective 
test performance. In a similar vein, 54.7 percent 
of the 580 reasonsgiven in support of a failing grade 
Were concerned with self-activities.Inthis instance, 
reversals of the reasons set forth by the successful 
students were encountered: poor usage of study time, 
lack of comprehension and unavailable kno w ledge,. 
the inability to apply the materials learned, as well 
as the lack of skill in effectively using the materials 
studied. 

In attributing test performance to either personal 
adeuacy of personal ineffectiveness. It becomes ap- 
parent that test success is perceived and experienced 
as a kind of triumph stemming from within, while 
failure is viewed in an intropunitive manner with 
self-blame or censure for poor performance. A test 
of chi-square applied between the two sex groups for 
the seven Sub-categories of both test failure did not 
yield any significant differences. 

Success or Failure Due to the Instructor -- With 
two-thirds of the reasons cited for test success, and 
slightly more than half of the reasons given for a 
poor test grade centering around self-activities, one 
would initially predict that the good students might 
minimize the effectiveness of the teacher in test suc- 
cess while the unsuccessful student would tend to 
project their failure to an outside source, specifi- 
cally the instructor, This, however, was not the 
case. As may be seen in both Tables I and II, one 
quarter of all the factors for good test performance 


(25.9 percent, women 23.6 percent and men 31.8 
percent) were attributed to theinstructor. Here, the 
students praised thequality of presentation, the in- 
structor's knowledge, his specific instructional tech- 
niques, the class discussion, personal interest, 
class rapport--among a number of others. 

When failure was attributed to the instructor, his 
activities or lack of them as found in 20.1 percent 
of the total reasonslisted women, 21.1 percent, men 
19.8 percent, the subjects cited reversals of the 
praiseworthy characteristics noted by the “А” stu- 
dents, as well as commentsonthenature of the tests 
given by their particular instructor. This appears 
overly defensive, inthat none of the subjects had 
previously had work with his instructor. One might 
speculate either on the effectiveness of campus ru- 
тог, departmental reputation, or, possibly, the 
need to establish some system of defense should the 
person do poorly in the examination. 

Success or Failure Due to External Factors: - - The 
most relevant activities for succes test perform- 
ance would appear to be problem-oriented thinking 
and activities directly related to the course topics, 
with emphasis onthe ideas directly under consider- 
ation on the test. Not related tothe test directly, but 
still confined to the class situation, would be the 
student’s thinking about people in the classroom-- 
especially the instructor. Butin the attempt to just- 
ify test performance, the student may introd u ce be- 
haviors external to the class process to support his 
own activities. In fact, one might predict that with 
greater test effectiveness one would expect few ex- 
ternal reasons listed. It also would follow, that low 
test success, would be accompanied by high external 
interference (cf. Gaier 1952a, 1952b). 

In attributing test success or test failure to causes 
beyond either self-activities or theinstructor, major 
group differences emerged. Of the total responses, 
only 8.2 percent of those given by the “А” students 
believed that their performance was a function of 
luck, test characteristics, Or previous work. The 
men (9.9 percent) seemed slightly more swayed by 
externalfactors but not significantly so (women 7.4 
percent). 

Mediocre test scores were assigned to external 
factors in approximately a quarterofthereasons ob- 
tained from ће “Б” groups (Total = 23.2, Men 23. 8 
percent, Women, 23.7 percent). Here, the subjects 
reacted to distractions and diversions in the test 
situation (eventhough the test was forthcoming) such 
as the testing situation, the heat, physical makeup 
of the test, as well as objects or situations and ac- 
tivities outside the test or course situation. Опе 
also finds evidence of reactions to other people and 
activities in the situation. These findings are in di- 
rect line with the author’s investigation (1952a) of 
the relation of personality characteristics and learn- 
ing variables in that: £ 

“We might expect that the higher the AE 
iety readiness, the more the individu 

will attend to objects in his environment, 
rather than to the subject matter itself... 


GAIER 
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Since the objects in the class are quite 
Structured, definite, and unchallenging, it 

is possible that they become anchorage 
points lending themselves to remove the in- 
dividual from the class material.’’ pp. 14. 

Thus, asthe student expects to encounter a treat- 
ening situation, in this case the possibility of a low 
test grade, he may attempt either to withdraw or 
endeavor to introduce stability by clinging to concrete 
details and familiar objects. These affective ele- 
ments may become so fused into his perception that 
he may become part of the external world which 
changes as these elements shift in their dynamic re- 
lation tohim, thereby experiencing little discomfort 
withpoor performance. The withdrawal into concrete, 
but nevertheless interfering perceptions, suchas in- 
dicated by the present “Р”? subjects, would serve to 
protect him by masking the most pertinent reason for 
poor performance: lack of relevant activity on his 
part. ” 

Denial of the possibility of an “А”:--Міпе stu- 
dents considered the possibility of successful test 
performance so out of line with their self-conc e pts 
and previous work that they rejected the entire as- 
sumption of making an “А”. Sixof the denials were 
attributed tothe instructor, perhaps, indicating that 
either class performance or personal interac Йоп 
would preclude this grade. More striking, perhaps, 
is that not one of the “D” students rejected the no- 
tion of receiving a “D”. 


Summary and Conclusions: 


The present investigation was concerned with an 
assessment of the activities and behaviors perceived 
by students as responsible or influential for either 
testsuccess or failure. А sample of 216 undergrad- 
uates enrolled in the Educational Psychology se- 
quence at Louisiana State University was asked to 
assume that they would receive agrade of “А”, or 
eD” onthe examination they were about to take. This 
examination was the first of the semester, andno 
student had had previous course work with the in- 
structor. Of this sample, 135 subjects were asked 
toassume thatthey would receive an “А” onthe test, 
while the remaining 140 were to receive а “D”, then 
to list the activities, either on their part or on part 
of the instructor, that they felt were influential or 
responsible in making the designated test score. The 
1207 listed responses were categorized into the areas 
of: a) Success or failure due to the self; b) Success 
or failure due to the instructor; and c) Successor 
failure due to external factors. The main findings 
based on an analysis of these categories, may be 
summarized as follows: 

1. Students who believed that they were to 
receive the grade of “А” associated test 
success with activities on their own be- 
half more often than students expecting а 
poor grade (67. 8 percent vs. 54. T percent). 
2. Test success or failure attributed to 
the instructor and his activities was more 
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frequent for the “А” group (“А” group = 
25.9 percent, ‘‘D’’ group = 20.1 percent): 
3. In attributing test success or failure to 
external factors, atleast, those beyond the 
self and the instructor, the ‘‘D’’ students 
introduced outside influences approximately 
three times more often than the students 
who expected to perform well on the exam- 
ination. 


As individual expectancies progressedfrom high 
to low test performance, evidence of increased pro- 
jection of personal inadequacies onto a non-threat- 
ening source seemed to occur. The student frequent- 
ly attempted to produce stability into his own per- 
ceptions by clinging to concrete details or familiar 
objects as did the present ‘‘D’’ group. The discrep- 
ancies between students' self-perception and the in- 
Structor's interpretation of test performance could 
resultinthe difference between mutual understanding 
and perceived class rejection leading to possible 
punishment -- in this case, a low grade. As the in- 
Structor attempts to develop more complex skills , 
а consideration not only of scholastic attainment but 
of affective accompaniment of student work becomes 
salient. For test performance cannot be viewed as 
w holly afunction of student's attitudes and habits of 
applying existing knowledge and skills in the test 
situation itself. Many testing procedures remain in- 
effective and represent time both wasted and anxiety 
provoking until the instructor learns how to adapt 
them, notonly to the varied intellectual abilities of 
the students, but also to their varied personality 
characteristics in the face of test threat. 


FOOTNOTES 
1. (1950) In her study of the problem-solving 


methods employed on comprehensive ex am- 
inations at the University of Chicago, Earle 


(1950), indicated that among the discrimina- 
tions used by her subjects as a function of test 
construction were: **(1) those having to do with 
language similarities or catch phrases; (2) de- 
cisions on the basis of truth or falsity where 
this was not intended as a criterion: (3) judg- 
ingthe significance of an author's ideas by how 
much space was devoted to it or by its ‘geog- 
raphical' position inthe article rather than by 
its logical implications; (4) the use of general 
logic, common sense, or general knowledge to 
answer questions supposed to test the compre- 
hension of particular reading selections in the 
course, (5) getting hints ontheanswers to the 
questions by noting the wording of other ques- 
tions rather than by prior knowledge of the 
Subject matter; (6) making a general principle 
of avoiding ‘‘none-of-the-above’’. 
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Ап Evaluation of a Joplin Plan of Grouping for Reading Instruction 


A CONDITION that createsa problem in reading 
instruction is the great variation in reading ability 
existing among children in the elementary school. 
Professional references in reading and collections 
of research reports cite this condition with consid- 
erable frequency. Classroom teachers also often 
indicate its prevalence as a problem in instruction. 

In the primary grades, the range is certainly e- 
nough to pose instructional problems, but in the in- 
termediate and upper grades, it is extensive enough 
to cause real difficulty inorganizing for instruction. 
For example, an unpublished study ofthe writer's 
revealed that while the range in reading ability of a 
group of second graders was four grade levels at the 
beginning of the school year, the range at the begin- 
ning of grade five was six grade levels. The differ- 
ences in reading ability in grade eight ranged from 
third grade level to eleventh grade level. 

Providing for individual differences in reading in 
primary grades quite frequently takes the form of 
grouping within the class for reading instruction. A 
study by Purcell (6) indicated that 98 percent of the 
150 teachers contacted in Ohio, Pennsylvania, and 
West Virginia followed that procedure. Inthe inter- 
mediate grades, various procedures have been used, 
including grouping within the classroom. 

There has been a recent revival of interest in 
cross-grade grouping as a method of meeting indi- 
vidual differences in reading. Such grouping is not 
a device of recent origin incaring for individual dif- 
ferences in reading. Harris has described its use 
before 1950 (3), and has presented the results at- 
tained. The revival of interest insuch grouping has 
been occasioned in part by the television publicity 
given the use of it in the schools of Joplin, Missouri. 
Cross-grade grouping has thus become kno wnas 
“Тһе Joplin Plan. ”’ 

In such a plan, reading becomes a departmental- 
ized subject in a school in which most, if not all, of 
the other subjects are taught by one teacher to one 
group of children. The basis for setting up а read- 
ing class is the grade level on which the с hildren 
read. For example, a fourth grader who reads on 
a third grade level will have reading W ith other 
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* Footnotes will be found at the end of the article. 
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children reading on the third grade level. The oth- 
er children with whom he has reading may be third 
graders, fourth graders, or in some grade above. 
Floyd has described the program as it operates 
in the Joplin schools (1) and has indicated that the 
results are favorable. Williams (7) has described 
the version of the plan that has operated inthe Chi- 
cago area with less spectacular results. This paper 
describes the Joplin Plan as it operated in grades 
four, five and six in two elementary schools in Lo- 
gansport, Indiana, during the school years 1958-60* 


Description of the Study 


In evaluating the Logansport version of the Jop- 
lin Plan, an attempt was made to find the answers 
to the following questions: 

1. What gains in reading achievement have been 
made by pupils between the time they e ntered the 
program and the end of their tenure in the program ? 

2. To what extent are the above gains commen- 
surate with the gains to be expected when the abili- 
ties of the students are taken into consideration? 

3. To what extent have the students involved in 
the program reached their expected reading level 
by the end of their tenure in the program ? 

4. To what extent has the program been benefi- 
cial to students of high and low learning ability ? 

5. How do the teachers whoare involved feel a- 
bout the program ? 

6. What are the feelings of the pupils concern- 
ing their own reading and the program ? 

Achievement test results were available for 144 
fifth and sixth grade pupils who had been enrolled 
inthe program for two Schoolyears. The cumula- 
tive records also contained test res ults for 75 
fourth graders who had been in the program for one 
school year. These results were usedto determine 
reading gains during the program. 

Each of the eight teachers who ha d participated 
in the program during its second year cooperated 
in the study by completing aquestionnaire concern- 
ing his teaching experience, training, methods of 
teaching reading, and his reac tions to the Joplin 
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Plan as it operated in his school. 

The 219 pupils who had participated in the pro- 
gram for either one or two of the complete school 
years of its operation were asked to complete a 
questionnaire concerning theirreading habits, feel- 
ings about their reading, and feelings concerning 
the way in which they were being taught to read. 
These questionnaires were completed under the di- 
rection of the writer. 


The Nature of the Program and its Subjects 


In the two schools involved, reading was taught 
ina single daily forty-minute period. АП inter- 
mediate grade teachers scheduled геа ding for the 
same period. At the beginningofthe reading period 
many children left their regular classrooms and 
went to others in which they hadreading. Each 
teacher taught reading to a group of 25-35 children, 
all of whom were reading onapproximately the 
same grade level. In this way, the range of reading 
ability of every class was reduced far below that 
which existed in the regular class. 

In the larger schoolinvolved in the study, there 
were five teachers in the intermediate grades, and 
it was thus possible to have reading groups on five 
different levels. In the other school, it was pos- 
Sible to have only three groups. 

In early October of 1958 and 1959, the Stanford 
Achievement Test was administered to all children 
in the intermediate grades. Thechildren were then 
divided into reading groups of fairly homogeneous 
nature as far as total reading ability was concerned. 
In the larger school (in which five teachers were in- 
volved in the program), the spread of reading abil- 

ity in each group in the fall of 1959 was as follows: 


Group Grade level range in reading No. of Pupils 


1 2.5 to 3.7 34 
2 3.6 to 4.5 34 
3 4.5 to 5.2 30 
4 5.2 to 7.9 35 
5 7.9 to 9.1 35 


In the smaller school, the spread of reading ability 
in each group was slightly more. 

After a child was assigned toa group, he contin- 
ued to have reading with thegroup unless the teach- 
er thought that he should be moved to another. (Less 
than two percent of the children cha nged groups 
during the year. ) 

Materials for teaching rea ding were so chosen 
that they were suited in difficulty to the reading a- 
bility of the majority of the children in the group. 
The basic readers published by the Ginn Publishing 

Со. were' used as basal texts in the program. Les- 
Sons from the reader occupied three of the five 
reading periods each week. Each teacher reported 
that the teacher's manual accom panying the basic 
reader was used as a guide for instruction. 


During the other two days of the week, attempts 
were made to improve students'skills in reading 
comprehension, vocabulary, and dictionary use. 
The following materials were used for these pur- 
poses; Reading for Meaning (2), Standard Test Les- 
Sons in Reading (5), and Adventures in Dictionary 
Land (4). In some classes, work inthese materials 
was individualized; in others, the lessons were to- 
tal group activities. 

Leisure time reading was encouraged by making 
available a supply of trade books. Twice a month, 
the city library delivered books to each school, so 
that pupils might check them out. Only two ofthe 
eight teachers reported that they used any reading 
class time for free reading, but six of the eight had 
at least one forty- minute period of free reading per 
week outside of reading class. Five of the eight 
teachers required a book report of each pupil each 
six-weeks grading period. One teacher required 
one book report of each pupil during the year. Two 
required no book report. 

Five of the eight teachers involvedinthe program 
had received regular elementary teacher education; 
one of these had a master's degree in that field. 
Three of the teachers had been trained for second- 
ary school teaching and were in the process of tak- 
ing college work to become permanently certified 
elementary teachers. Six of the eight had taken one 
or more courses in teaching reading in the elemen- 
tary school. Two hadreceived no specialized train- 
ing of this type. Teaching experience of the group 
ranged from one year to over twenty; the median 
number of years of teaching experience was nine. 

The children in the program were fairly hetero- 
geneous in socio-economic level and family back- 
ground. Although one of the schools was in an older 
Section of the city occupied chiefly by people of low 
income, only about thirty percent of the children in 
the study were enrolled in this school. The other 
School was in a neighborhood in which middle and 
upper income families live. The intellectual level 
of the children was as follows: 


Grade Mean 1. 9. 2 
4 105 
5 104.5 
6 108 


The pupils іп the study had beentaught reading in 
the primary grades in afair ly typical program. 
The core of instruction in eac h grade had been а 
basic reader. Intra-class grouping was used to 
care for individual differences. Тһе basic readers 
had been Supplemented by using theHay- Wingo 
Phonics materials published by the McCormick- 
Mathers Company. In the fourth grade, allpupils 
had used the special phonics materials. In the fifth 
and sixth grades approximately 50% had used such 
materials, 

Parents were informed of the nature of the pro- 
gram after it had been initiated. This was done 


TABLE I 


TEST RESULTS OBTAINED AT BEGINNING 
AND AT END OF THE PROGRAM 


Grade Initial Status Final Status 
Composition Vocabulary Total Composition Vocabulary Total 
*4 *4. 1 *4.4 *4, 55 *5.5 *5.0 #5525 
5 4,20 6. 46 
6 4.8 5.0 4.90 1.5 7.0 7.25 


ж Grade four was involved in the program only one school year; Grades five and six * 
were involved for two school years. 


TABLE П 
A COMPARISON OF READING GAINS AND FINAL STATUS WITH EXPECTED ACHIEVEMENT 


Grade Gain Expected Actual Gains Expected Actual Final Status 
Comp. Vocab. Total Final Status Comp. Vocab. Total 
4 *0.84 0.83 0.60 0. 72 5.5 5.5 5.0 5.25 
5 1.88 2.26 6.5 6.10 
6 1.94 2.7 2.00 2.35 155 25 7.0 7.25 


ж 


Since the fourth grade was involved in the program for only one school year, the expected gain i 
than that for the other two grades. р gain is less 


AHSWVH 


696 


570 THE JOURNAL OF EDUCATIONAL RESEARCH 


chiefly through a series of meetings by the Parent- 
Teacher Association, іп whichteachers and parents 
reviewed the program of groupingby means of pan- 
el discussions. 

The use of existing achievement test results to 
determine each pupil's gain in reading during the 
program was the cause of one limita tion of study. 
Since the fifth grade teachers involved in the pro- 
gram did not all recordachievementtest results for 
both comprehension and vocabulary, it was impos- 
sible to determine pupil growth in either of these 
two areas. Test results for grade five are expres- 
Sed in terms of total reading achievementonly. The 
test results obtained at the beginning and at the end 
of the program are shown in Table I. 

In an attempt to answer the second and third 
questions originally posed (c oncerning a compari- 
son of gains and final status with expected gains and 
expected final status), the expected gain and final 
reading expectancy were determined for each stu- 
dent. The expected gain was determined by multi- 
plying each pupil's intelligence quotient by the 
length of time he was involved in the program. 
Final reading expectancy was obtained by multiply- 

ing each pupil's intelligence quotient by the number 
of years he had been in school and adding one year. 
(The latter formula is one suggested by Bond and 
Tinker? for use in finding reading expectancy.) The 
expected gains and the final rea ding expectancy of 
each pupil were then compared with his gains and 
final status, The mean for each grade was deter- 
mined. The results are shown in Table II. 

In determining the program's effecton the read- 
ing achievement of students of highand low learning 
ability, the test results wereanalyzedinthe follow- 
ing way: The 219 pupils were ranked according to 
1.0. and the reading gains of the upper and lower 
thirds of the distribution were determined. These 
were compared with expected gains. The results 
are given in Tables III and IV. 

The questionnaires completed by the eight teach- 
ers in the program were examined to determine 
their feelings concerning the plan of grouping. In 
general, teacher opinion was veryfavorable toward 
the method of grouping and instruction. They em- 
phatically stressed its advantages over the former 
method used in their classrooms, which was one in- 
volving the use of one levelof basic reader for ev- 
егу child іп а classroom. They also liked the free- 
dom from being required to finish a whole book by 
the end of the year. Severalexpressedthe wish for 
a longer reading period. Althoughthe questionnaire 
did not directly ask if the teachers would continue 

with the Joplin Plan of grouping, ifthey had free 
choice, it is the belief of the writer that all of them 
would do so. 

The following quotations from the teachers' writ- 
ten remarks seem to the writer toexpress the feel- 
ings held by most of them: 

‘I believe that this method facilitates making a- 

vailable to each child material which challenges 


him—especially the more advanced pupils. ” 

*qt gives the youngster a better feeling of secur- 
ity and it also makes iteasier for the teacher to 
meet the children's needs. ’’ 

“Тһе children are reading material on their own 
level— it is a joy to have them—working together— 
knowing that the material is not boring some be- 
cause it is too easy or leaving some ina ‘fog’ be- 
cause it is too difficult. '' 

There were a few statements that indicated feel- 
ings of reservation. Some of these were as follows: 

‘‘This program сап do no more thanany other 
good reading program. ”’ 

“J feel that we need to include more individual- 
ized reading in our classes. ’’ 

‘Our program is not the ultimate solution to the 
problems of teaching reading— for our retarded 
readers we seem to have accomplished little more, 
as yet, than in a regular reading program. ’' 

Responses to the pupil questionnaire were tabu- 
lated and the general reactions to certain questions 
were noted. In response to two items designedto 
determine which subjects were the most liked and 
least liked, nineteen percent indicated that reading 
was one of the two subjects liked most. (No one 
subject was listed as a favorite by a majority of the 
children.) Less than four percent listed reading as 
one of the two subjects liked least. Over half of the 
latter were pupils reading above grade level. 

In response to the question, ‘Which of the fol- 
lowing do you like to do best in your spare time 2” 
the proportion who checked each response was as 
follows: 


Watch television 18. 1% 
Play baseball or basketball 35. 4% 
Play something else 12.7% 
Read a book 33. 8% 


Only six percent of the pupils answered “Үев”” to 
the question, ‘‘Do you think you would learn to read 
better if you were in a different reading group ?”’ 
However, in response to the question, *Which read- 
ing group would you like to be in?"' twelve percent 
indicated a group other thanthe one they were in for 
reading. Two thirds of those who wanted to change 
groups wanted to move to a higher group. The oth- 
er third espressed a preference to move to a lower 
group. Over half of the latter were inthe top read- 
ing group. 

Fourteen percent of the childrenanswered “Үев” 
in response to the question, ‘‘Would your mother or 
father like for you to be in a different reading 
group?" Twenty percent answered “Мо” to the 
Same question. The remainder checked, ‘I don't 
know’? as a response. 

In two classes, one fifth grade and one sixth 
grade, an open-end question was included in the 
questionnaire. This question read, *What is your 
opinion of the way you are being taught to read? Do 
you like it or dislike it? Why?’’ Of the fifty-two 


- 
eee Н т aa 


TABLE Ш 


READING GROW TH OF THE UPPER THIRD (I. Q. 110-136) 


_————————====—====—===—=————————————— 


Grade N Gain Expected Gain in Comp. Gain in Vocab. Gain in Total 
I 
4 26 85 1.15 67 91 
5 26 2.02 * * 2.69 
6 22 2.01 2.79 1.97 2.01 


* Reading gains for fifth grade available only for total reading growth. 


TABLE IV 


READING GROWTH OF THE LOWER THIRD (І. ©. 77-100) 


нн FE eee 


Grade N Gain Expected Gain in Comp. Gain in Vocab Gain in Total 


4 21 10 .65 


50 .58 
5 27 1.63 * * 1.89 
6 25 1.58 1.54 1.17 1.38 


* Not available. 
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children in these two classes, only five answered 
that they didn't like the procedure. Some of the 
Statements of those who expressed a like or dislike 
are given below: 

“I like it. Make it longer. '' 

“I like it very much because Idon't have to read 
like a fifth grader but the wayI want to. "(From a 
fifth grade girl reading in a seventh g ra de reading 
class.) 

“I don’t like it. I want more to do." (From а 
fifth grade boy reading at sixth grade level. ) 

“I like it because it is different than a reading 
circle, ” 

“1 have learned more this year than any other 
year. "' 

“I don't like it because you have to go from опе 
room to another. '' 

“It’s the best anyone could ask for. '' 

“Т don't like it because the work is too hard. " 
(From a sixth grader of average intelligence who 
was reading on a fifth grade level. ) 

“1 like it better than when we had to getin a 
group and read. ” (From a sixth grade girl reading 
on the fourth grade level. ) 

“I think it is very well because the people who 
are slower are not in a class they can't understand." 
(From a sixth grade boy of low1.Q. who was reading 
ina third grade class. ) 


Conclusions 


The program of cross-grade grouping appeared 
to be effective in producing expected reading gains 
for all three grade levels, when each group was 
considered as a whole. For those who were in the 
upper third of the classes in intelligence, it was ef- 
fective in producing gains equal to or greater than 
expected, except for the fourth grade in vocabulary. 
For those children who were in the lower third in 
intelligence, it was not effective in producing gains 
ав great as expected, except in the fifth grade. 

Teachers generally looked withfavor on the pro- 
gram and believed it was effective in caring for in- 
dividual differences in reading, except possibly for 
those who were retarded in reading (one teacher ex- 
pressed this reservation. ) 

Children had few objections to it; many liked it 
very much. Reading seemed to be held in fairly 
high esteem as evidenced inthe selection of reading 
as a favorite subject by one student in every five, 

and the designation of reading as a favorite leisure 
time activity by one third of them. Only asmall 
proportion indicated that their parents thought they 
should be in a different group. This is interpreted 
as a sign of a lack of parental objection to the pro- 
gram. 


It is recognized that in the instances in which 
gains exceeded expectation, the excess of gains over 
expectancy could not be attributed totally to the 
grouping plan used. It is further recognized that 
there were several variables in the situation that 
could not be measured or their exact influence de- 
termined. This study is reported inorder that it 
may become part of the accumulating body of know- 
ledge concerning the effectiveness of various types 
of reading programs. Its limitations shouldbe tak- 
en into account when any attempt is made to gener- 
alize from it or to apply the plan of grouping to oth- 
er educational situations. Study and evaluation of 
the Joplin Plan of grouping in the Logansport 
Schools will continue. The findings will help to 
substantiate or refute the present tentative conclu- 
sions. 


FOOTNOTES 


1. The writer is indebted to Mr. C. G. Smith, 
principal, and the teachers and pupils of the 
two schools for their cooperation during the 
study, 

2. The intelligence quotients were obtained by ad- 
ministering the Otis Beta Intelligence Test 
just prior to the inauguration of the Joplin 
Plan of grouping. 

3. Bond, Guy L. and Tinker, Miles, Reading Dif- 
ficulties: Their Diagnosis and Correction. 
New York: Appleton-Century-Crofts, Inc., 
1957. 
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The Relative Efficiency of the Various Approaches 


to Writing with the Left Hand 


A PREVIOUS report, taken from an unpublished 
dissertation by the writer} covered the extent of 
the use of the left hand in handwriting. It was found 
that 11 percent of pupils in grades one through six 
preferred the left hand. Among boys, 12.5 percent 
used the left hand, and among girls, 9.7 perce nt. 
This second report will deal with the re lative ef- 
ficiency of the various approachesto writing with 
the left hand. 


The Literature 


In a study of 108 references dating from 1847 to 
the present, the writer found no mention of left- 
handedness until the year 1915. These references 
included (1) handwriting instruction books for pupils 
and adults, (2) teachers’ manuals in handwriting, 
(3) books devoted exclusively to handwriting, and 
(4) courses of study dealing with handwriting. 

From the first mentioning of the problem in 19152 
there has been a gradual increase in reference to 
left-handedness. The literature, to the present, 
revealed six different approaches to the problem. 
The suggested slope of letters variedfrom the more 
easily read forward slant, to verticle, and to ques- 
tionable backhand. Yet, with all of the disagree- 
ment and dearth of understanding, there were no 
research attempts to findbe tter answers to the 
problem confronting the teacher of this segment of 
the school population. West3 even recommended 
such a study as early as 1927, but the sugges tion 


went unheeded. 


Techniques Actually Used by Left-Handed Writers 

The writer began his search for methods actually 
used by children in classrooms with the aid of a 
camera and sketch pad. Careful notes were taken 
of likenesses and differences among the techniques 
employed by thesewriters. Grades five through 


eight were selected for this phase of the study. It 
was felt that in these more advanced grades, habits 
hniques of writing 


would be well set, resulting in tec 
that were established sufficiently to permit compar- 


*Footnotes will be found at the end of the article. 


E. A. ENSTROM 
Greensburg, Pennsylvania * 


ative evaluation. 

The two-year study of this phase of the problem 
revealed that more classifiable methods and varia- 
tions were in use than were listed in the literature. 
Instead of six approaches, the writer found 15 dif- 
ferent techniques that could be classified and that 
were used frequently enough to be tested for relative 
efficiency. 

The classroom observer will discover that there 
are two general groups attempting to write with the 
left hand: those who keep the writing hand below the 
writing line (Group І inthe writer's study) and those 
who approach from the left side of the paperand 
more or less ‘‘hook’’ the wrist while writing (Group 
П). A careful look will revealmany differences 
among pupils found in each group. 

In Group 1, there were six different classifiable 
variations taken into account,(1) the relationship be- 
tween the lower edge of the paper and the front edge 
of the desk, (2) the angle formed between the arm 
axis and the paper ruling, and (3) the dire ction of 
the movement which determined leverage (or lack 
thereof) and slope (slant, or lack thereof) in the 
written product. For details of Group I techniques, 
with variations, see the plate entitled ‘‘Group 1.” 

Group II, consisting of the loosely termed‘‘hook- 
егв,” included nine classifiable techniques. Уагі- 
ations were found (1)in the placing of the paper in 
relation to the front edge of the desk (which also af- 
fects body posture), and (2) the extent to which the 
wrist is turned on edge and “flexed” during the 
writing process. Some writers keep the wrist on 
edge sufficiently to facilitate a hinge action at the 
joint as the basic movement supporting the writing. 
Others keep the wrist flat (probably the resultof 
teacher instruction directed to r ight-handed writ- 
ers) that permitted only poor wrist movement. Still 
others may be classified as being approximate ly 
half-way between these twoextremes. Descriptions 
of each of these adjustments are found on the plate 
entitled ‘‘Group П. ” 4 

A comparison of the frequency of the use of tech- 
niques showed that certain adjustments held prefer- 
ence by pupils in both GroupsIand П. In Group І, 
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GROUP I 


W riting-hand-below-the-writing-line adjustments. All these 


adjustments are relatively smear-free. 


I-C 


I-Dj 


1-D2 


Paper is not turned enough to be 
the reverse of right-handed placement. 
Angle between the arm axis and paper 
ruling is less than 909, Writing slope 
varies but is downward (usually very 
irregular). 


Reverse of right-handed position. 
Arm axis is 909 with paper ruling. Down- 
ward strokes are directed toward elbow, 
“into the sleeve.'' Writing slope approaches 
vertical and is usually somewhat irregular. 


Reverse of right-handed position. Arm 
axis is 90? with paper ruling. Slant strokes 
are directed downward and outward (le ft - 
ward). Slant is generally forward and uni- 
form, 


Paper is turned more to the right 
(clockwise) than the reverse of right-handed 
placement. Arm axis angle with paper 
ruling is greater than 909, Slant motion 
is a sideward (leftward) push of the writing 
arm, Slant is uniform ly forward. 


Extreme turning of the paper. Paper 
ruling is 90? (plus or minus 59) with the 
front edge of the desk. Slant motion is a 
Sideward (leftward and upward) push of the 
Writing arm. Slant is generally forward 
and uniform. 


Very extreme turning of the paper. 
Paper ruling is greater than the 909 (above). 
Angle between top of paper and the front 
edge of the desk is usually between 509 and 
309. Slant is a sideward (upward) push of 


the writing arm, Slant is generally forward 
and uniform. 
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Writing-hand-above -the -writing-line adjustments. Inherent 


smearing possibility exists with all these writing techniques. 


А Adjustments numbered 11-А. Paper is turned leftward as for 
right-handed placement. Тһеве-аге the three basic hooked wrist 


techniques. 


ТА: 
1I-Aa ON 
5% 
26 
жел ы 
Wrist is turned on 


edge enough to permit 
maximum flexing. 


Wrist is somewhat 
flattened, Less flexing 
--more finger move- 
ment. 


Wrist is flat, Prac- 
tically no flexing--all 
finger or whole arm 
movement, or mixture 
of two. 


Adjustments numbered II-B. Same as II-A except that the paper 
ruling is generally parallel with the lower edge of the desk (plus or minus 


50); 


Wrist is turned оп 
edge enough to permit 
maximum flexing. 


Wrist is somewhat 
flattened. Less flexing 
--more finger move -~ 
ment. 


Wrist is flat. Prac- 
tically no flexing--all 
finger or whole arm 
movement, or mixture 
of two. 


Adjustments numbered 11-р. Same as II-A but paper is turned 
rightward (clockwise) as for 'correct'' left-handed placement, yet wrist 


is still hooked. 


II-Db Sug II-Dc SON 
М I WAS 
Y > А ҸӘ) 


II-Da 


Wrist is turned on 
edge enough to permit 
maximum flexing. 


Wrist is somewhat 
flattened. Less flexing 
--more finger move- 
ment. 


Wrist is flat, Prac- 
tically no flexing--all 
finger or whole arm 
movement, or mixture 
of two. 
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the most frequently used techn ique, it waslater 
found, held high advantage in both Speed and quality. 
In Group П, a less desirable technique was used 
more frequently. With advancing grades, however, 
it was found that this less desirable method tended 
to convert into the most desirableapproach for this 
often condemned group. 


Evaluation of the Adjustments 


The criteria used as a basis for the evaluation of 
the various techniques employed were (1) the quality 
of the handwriting product, (2) the rate of writing, 
(3) ability to produce neat, smear-free papers, and 
(4) healthful body posture considerations. In addi- 
tion to the study of the quality of papers produced 
and the time required in writing them, notes re- 
garding smearing tendency and possible health haz- 
ards were made during the initial Study and classi- 
fication of all left-handed writers. 

Every precaution was taken in the eva luation 
phase of the work to make sure that each pupil was 
actually using his classified technique. Testing for 
Speed was done by using the best combined recom- 
mendations for such testing. Whole classes were 
tested and left-handed writers were not aware that 
the testing concerned them. Quality Scores were 
obtained through the use of the Ayres Scale ? with 
trained experts rating the scrambled papers five 
Separate times. The average of the five ratings 
was used as the final quality score with Special pre- 
cautions taken for any rarediscrepancies. None of 
the persons Scaling the papers had information as 
to which of the fifteen techn iques was used by the 
Writer of any given page. Two different samples 
Were used in the quality testing: (1) The rate test 
where the pupils were instructed to write **as well 
ав you can, as fast as you can, ’’and (2) an addition- 
al sample where the pupils were merely asked to 
Write as well as they could. Favorable comparison 
here showed that the original instructions had been 
followed. 


Scope of the Study 


At first it was thought that the Study would be 

Sufficient in scope if 400 left-handed writers 
were classified and evaluated for efficiency. How- 
ever, upon breaking down the four grades into a 
possible fifteen different adjustments in each, itwas 
thought advisable to extend the study by at least 200 
more cases, After this had been done, it was then 
further decided that, to remove all possible doubt 
regarding findings, 500 additional cas es would be 
Studied, tested, compared, and included. After the 
elimination of any casesthat would possibly inter- 
fere with valid findings (pupils using two different 
techniques, handicapped pupils unable to follow test- 
ing instructions, etc. ), 1,103 usable lef t-handed 
writers were included inthe final evaluation. This 
proved more than ample in Securing the desired in- 
formation. 


It should also be mentioned that the findings were 
obtained from schools where handwriting is general- 
ly well taught through a program of in-service help 
consisting of workshops, class visitations and dem- 
onstrations by resource personnel available at 
Scheduled intervals during the school term.Ó The 
chief objective of the study was to find the efficiency 
potential of each adjustment in use. 


Findings: Relative Effic iency of Various Tec hniques 


7 In Group I, the I-E adjustment (not taught in the 
area prior to the testing, but discovered by pupils 
in the group) also rated highest inquality and in rate 
of all six Group I techniques. In each grade, pupils 
using this approach ranked much higher than the 
quality norms set for the grade. Thesame was true 
for speed scores in all grades except eighth, where 
the rate finding was thesame as the norm. (The 
exceedingly high quality here apparently slowed the 
Speed to some extent. ) 

The I-D; adjustment (taught in the area) was next 
in frequency of use, was above grade standard in 
quality, but below rate standards in grades five and 
Seven, almost at the rate standard in grade six, and 
above in grade eight. Thegeneral picture here, 
while good, was not as favorable as in 1-Е. 

The I-F technique (not taught, but discovered by 
pupils) fluctuated more in findings than the two pre- 
vious adjustments. It was above quality standards 
in grades five, six and Seven; below ineight. It was 
below rate standards in grades five and six; above 
in seven and eight. 

The frequently recommended (in the litera ture) 
I-D, technique (pulling the arm “іпіо the sleeve’) 
was at quality standard ingrade five, but well below 
standard in grades Six, seven, and eight. In rate, 
grade five was slightly below s tandard, grade six 
further below, but grade sevenand eight were above. 
This speed, however, resulted in the previously 
mentioned very low quality scores for these two 
grades. In spite of its frequent recommendation, 
there is nothing to suggest that this adjustment 
Should be taught to left-handed pupils as a means of 
Solving their writing problems. 

The other two adjustments in Group I (I-C and I- 
G) were so poor (or fluctuated so much) as to war- 
rant no consideration at all. 

In Group II, only one approach can be recom- 
mended, and then with some reservations. This 
more desirable “һоокей” technique was used by 20 
percent of pupils in this class ification. This ap- 
proach is П-Аа, and showed very high quality with 
above standard rate in all grades except grade five. 
However, there are two inherent drawbacks to this 
Solution: (1) The side of the hand does not move 
readily across the page, particularly if the hand 
tends to perspire. (2) Since the hand, vigor ously 
flexing, moves through the work written, there is 
always danger of Smearing--again, more so if the 
hand perspires. All other Group II approaches were 
either low in speed and quality, fluctuated greatly, 
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or presented possible health hazards (stooped shoul- 
ders, twisted backs, etc.), so cannot be considered 
desirable. 


Conclusions: Adjustments Holding the Greatest 


Promise for Success 


In Group I, half of the six adjustments can be 
considered generally good infour efficiency consid- 
erations: (1) quality, (2) rate, (3) freedom from 
smearing, and (4) posture. Of the remaining, one 


adjustment (I-G) is erratic because of too few cases; - 


another, the frequently recommendedI-D, approach 
is very low in quality, and still another, the I-C, is 
very low in both quality and rate. 

The best Group I adjustments, in order of desir- 
ability, are: 


IE XN IDA d 
S 2237: 
LE Ў [ү ть ЭХ, ің 


These three success-producing techniques make up 
approximately 69 percent of Group I writers. 

In Group II, only one adjustment out of nine can 
be considered good, and then with reservations. It 


is: 
Tho 
P ЖЫ 


While this adjustment, used by only 20 percent of 
the group, ranks high in both quality and rate, in- 
herent weaknesses exist and, under less than favor- 
able conditions, can be very handicapping. These 
weaknesses, as mentioned, are the difficulty of 
sliding the side of the hand forward in carrying the 
writing across the page, and smearing tendencies 
under conditions that cause hand perspiration. 

The remaining 80 percent of Group П writers use 


modifications in the adjustment that cannot be rec- 
ommended, either from (1) the efficiency standpoint 
in quality, rate, or evengreater tendency tosmear , 
or (2) from physical health considerations. They 
seem to have little to recommend them. 

It can be concluded that Group I techniques are 
best, all things considered, and should be taught at 
beginning stages in writing. Adjustment П-Аа is 
the most desirable of the **hooked"' group and all 
advanced grade hookers who are unable, or do not 
desire,to make major changes should be helped to 
modify their approach to conform most nearly with 
this more desirable technique. 


FOOTNOTES 


ж The Peterson System of Directed Handwriting, 
Greensburg, Pa. 
1. Enstrom, E. А., The extent of the Use of the 


Left Hand in Handwriting and the Determina- 
tion of the Relative Efficiency of the Various 


Hand-Wrist-Arm-Paper Adjustments, unpub- 
lished Ph. D. dissertation: University of Pitts- 


burgh, 1957. Dr. Herbert T. Olander was 
advisor. 


2. Zaner, C. P., Zaner Method Writing, Arm 


Movement, Teacher's Manual No. 4, 5, Co- 
Jumbus, O.: Zaner and Bloser Co., 1915, p. 
14. 

3. West, Paul V., Changing Practice in Handwrit- 
ing Instruction, Bloomington, І11.: Public 
School Publishing Co., 1927, 55-56, 133. 

4. Inaddition, a more detailed description of each 
adjustment is found intheoriginal dissertation 
for those who wish to delve into the problem 
even further. See 1., above: 24-26, 45-60. 


5. Ayres, Leonard P., Measuring Scale for Hand- 
writing, Gettysburg Edition, Princeton, N.J.: 


Cooperative Test Division of Educatio nal 
Testing Service. 
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Organization and Supervision of Student Teaching in Home 
Economics Education in Off-Campus Centers 


THE PURPOSE OF this study was to discover 
the practices being carried on in training home eco- 
nomics students in off-campus centers in home eco- 
nomics departments in other States, as well as in 
Kansas. Another purpose was to determine ways in 
which a similar program at Kansas State College of 
Pittsburg may be improved in order to carry on a 
more effective program in home economics teacher 
education. 


The Problem 


It was the purpose of this study 1) to determine 
practices which colleges and universities have 
deemed desirable in training student teachers in 
home economics; 2) to obtain recommendations for 
improvement of present Programs in colleges and 
universities reported; 3) to compare practices of 
other colleges; and 4) tomake recommendations for 
improvements at Kansas State College of Pittsburg. 

Letters and questionnaires were sent to45 teach- 
er educators in home economics,educators in state 
Supported colleges and universities. Replies were 
received from 31, or 68 percent of the schools. In- 
formation concerning these schools was gained from 
Mary Irwin's Report on American Universities and 
Colleges, 1956, published by the American Council 
on Education (6). 


Review of Literature 


Literature included in this Study pertained to in- 
terests in teacher education in home economics as 
reported in periodicals and home economics educa- 
tion tests. 

Ivol Spafford (6) reported that as early as 1922, 
students in Alabama Collegelived at Columbiana, a 
small town twenty miles away, for a period of six 
weeks. This was an experience to acquaint the 

home economics student with a11 the activities en- 
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countered in teaching. 


Spafford also reported for the school of home 
economics at Ohio State that ‘‘as early as the fresh- 
man year, students are made aware of the oppor- 
tunity to secure a school, community experience. 
A student undertaking such an activity has both a 
planning and follow-up conference with an advisor?’ 

Florence Stratemeyer and Margaret Lindsey (10) 
reported that cooperating schools are finding ways 
to increase or decrease the length of the student 
teaching period in terms of the needs of individual 
students. 


The Kansas Association for Student Teachi ng 


Newsletter (T) issued at Washburn University, Oc- 


tober, 1958, gave the report of the Committee on 
Qualifications and Selection of Cooperating and Su- 
pervising Teachers as follows: 

“Ав one factor in the impro vement of the pro- 
gram in Teacher Education, the committee recom- 
mends that by the fall of 1964, a11 cooperating and 
Supervising teachers be required to have special 
certificates to be granted to teachers meeting the 
following standards: 

1. Possession of а regular degree certificate. 

2. A minimum of 15 semester hours of recent 
graduate work applicable toward a Master's 
degree (received within three years ofthe 
date of application for certificates). 

3. A minimum of two years of successful teach- 
ing experience. 

4. College credit or certified workshop exper- 
ience in work related to the supervision of 
student teaching. 

5. Recommendation by the chief school admin- 
istrator in the district in which the teacher 
is employed. 

“Тһе term of the certificate wouldbefive years; 

renewal by recommendation of the chief schoo! ad- 
ministrator of the district. ” 


| 


Analysis of Data, Results, and Discussion 


The following practices were reported on sam- 
ples by teacher educators in home economics edu- 
cation in 31 liberal arts colleges and universities 
referred to previously in this study. 


TABLE I 


STUDENT TEACHING IS OFFERED 


Basis Colleges Reported Percent 


Semester basis 18 58 
Quarter basis 13 42 


Teacher educators in home economics education 
reported 58 percent of colleges operating on the 
semester basis and 42 percent operating on the 
quarter basis. 


TABLE II 


TEACHING EXPERIENCE PREVIOUS TO 
PROFESSIONAL SEMESTER 


————————————————————Є—Є——— 


Experiences Colleges Reported Percent 
None 22 71 
Sophomore year 1 3 
Junior Year 5 16 
September experience 2 1 
Only six hours 1 3 


MENT. Башар UEM T puse 


Two teacher educators reported that arrange- 
ments were made for student teachers to spend two 
weeks observing and assisting inhomemaking clas- 
ses in their home towns prior to fall enrollment at 
college. This type of experience was also referred 


to in the review of literature as practiced in Ohio. 
Ninety-seven percent of the teacher educators 


reported that their colleges maintained off-campus 
centers for student teaching. (See Table ш.) Only 
29 percent maintained the laboratory School for the 
entire teaching experience in home econom ics. 
Kansas State College of Pittsburg offers both ex- 
periences. 

As Table IV shows, teacher educators were re- 
ported in the greaternumber, 87 percent, as having 
the responsibility of selecting the off-campus teach- 
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TABLE III 


CENTERS FOR SENIOR TEACHING EXPERIENCE 
———————————Є—————ЄЄ—ЄїүЁ 
Colleges Reported Percent 


Off Campus 30 97 

Public School 18 58 

Laboratory School 9 29 
TABLE IV 


SELECTION FOR TEACHING CENTER MADE 
BY THE FOLLOWING 


Colleges Reported Percent 


State supervisory staff 23 74 
Director of student teaching 17 55 
Home economics teacher ed- 

ucator 27 87 
Dean or chairman of the home 

economics department 11 39 


------------------------------ 


ing center for home economics. The next highest 
in home economics were the state supervisory staff. 
It is recognized in many states that the campus 
teacher educator in home economics and state su- 
pervisory staff cooperate in this area because of the 
close relationship of state departments and reim- 
bursed departments with state colleges. 

Fifty-five percent of the directors of teacher ed- 
ucation selected off-campus centers as reported on 
the samples. It is believed bythe writer that the 
selection should be a shared responsibility, not left 
entirely with one person. It is belie ved also that 
personnel trained in home economics understand the 
varied field better than educators in other areas. 


Summa: 


The four objectives included in the questionnaire 
are stated as follows with tabulated results pertain- 
ing to each. 

1. To determine practices which colleges and 
universities have deemed desirable in training stu- 
dent teachers in home economics. 

Eight colleges were reported as spending no time 
in preparation of student teachers before they left 
the campus. Time spent inoff-campus centers var- 
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ied from four to 18 weeks with nine weeks reported 
more frequently. 

Only three colleges were reported as providing 
no time for evaluation of off-campus experiences on 
the return of the students to the campus. The 
greatest time spent was nine weeks. 

Student teachers were enrolled in student teach- 
ing during the junior year in a large majority, or 
83 percent, of the schools. Enrollment in student 
teaching varied from two to 40 students each se- 
mester. Over three-fourths, or 75 percent, of the 
student teachers observed high school classes dur- 
ing the junior year before student teaching. More 
than twice as many observed in secondary class es 
as in elementary. Observations were made by 79 
percent in three or more classes representing dif- 
ferent age groups. 

Teaching experience before entering the profes- 
sional semester was not considered important. 

Ninety-seven percent of the teacher educators 
reported that their colleges maintained off-campus 
centers for student teaching in home economics. 

Teacher educators in home economics reported 
in the greater number, 87 percent, as having the 
responsibility of selecting off-campus centers. O- 
ver half, or 55 percent, of the directors of student 
teaching served in this capacity. 

Almost 85 percent of the teacher educators re- 
ported that schools for off-campus teaching sup- 
ported the reimbursed program. АП teacher edu- 
caturs reported that the off-campus centers for 
home economics were required to have facilities 
for all instruction areas—foods, clothing and home 
living—as well as adequate teaching materials in 
the department. 

Thirty-five percent of the off-campus supervi- 
sors were required to have master's degrees, 
while 48 percent were expected tobe working toward 
one. 

In 77 percent of the 31 colleges and universities, 
the teacher educator in home economics made the 
arrangements with off-campus centers. 

One-half of the supervisors were considered as 
members of the home edonomics education depart- 
ment at the college or university. 

Sixty percent of the off-campus supervisors re- 
ceived some reimbursement. 

Ninety-four percent of the teacher educators 
promoted good relationships with off-campus school 
administrators, supervising teachers, and student 
teachers. Conferences were held with student 
teachers before they left the campus. Eighty-one 
percent of the teacher educators held conferences 
with supervisors before the student teachers left the 
campus. Almost 85 percent of the teacher educa- 

tors held conferences with supervisors and student 
teachers to evaluate the growth of the student teach- 
ers. 

More than three-fourths oftheteacher educators 
in home economics do not have full responsibility 
of the campus laboratory school. Teacher educa- 
tors completed their load by assuming at least two 


college classes in home ec onomics, in addition to 
methods. Ninety-three percentassisted in advising 
students. Ninety percent car ried on committee 
work. Sixty percent assumed adm inistrative work 
and research. 

Visiting off-campus students for an entire day 
was possible by forty percent of the teacher educa- 
tors. Grades for off-campus teaching were deter- 
mined jointly by 93 percent of the supervisors and 
teacher aducators in home economics education. 

Reimbursement for expenses during off-campus 

visits was received by 90 percent. 
' 2, To obtain recommendations for improvement 
of present programs in colleges and universities as 
reported by teacher educators in home economics 
education. 

Excellent rapport with cooperating schools, co- 
operating teacher, student teacher and teacher ed- 
ucator on the campus was r ec ommended as essen- 
tial for a well-rounded teacher education program 
in home economics education. 

Evaluation of the student teacher's past experi- 
ences should be made carefully and cooperatively 
by the student teacher and teacher education in home 
economics education in order to plan for further ex- 
periences in teaching. 

Participation in the activities of Future Home- 
makers of America should be a part of the total ex- 
perience in student teaching. 

Every student teacher in home economics should 
experience full responsibility of classes toward the 
end of the student teaching period. 

Every student teacher should have experiencein 
helping supervise and evaluate home project s. 

Cooperative evaluation devices should be used 
by student teacher, supervisor, and teacher educa- 
tor in home economics education. 

Prospective student teachers in home economics 
would gain valuable experience in visiting the first 
two weeks of the fall term in the schools in their 
home towns before returning to the campus. 

At least a two-hour period might be planned for 
student teachers to meet with the methods class on 
their return to the campus to evaluate their exper- 
iences and answer questions. 

The teacher education staff on the campus and 
the off-campus supervisors should c onsider plan- 
ning an experimental program to determine 
strengths and weaknesses of the teacher training 
program. 

A follow-up study made withall first-year teach- 

ers should be very valuable in promotinga stronger 
teacher education program in home economics, as 
well as providing a closer relationship between the 
campus and the alumnae of the college. 
_ The subject matter staff would gain much helpful 
information in visiting off-campus centers. Аза 
result, an evaluation of the department program 
might evolve. 

All supervisors might well be invited to depart- 
ment activities on the campus. 

Additional time is needed for three-way confer- 


ences —student teacher, supervisor and teacher ed- 
ucator working together in o rder to strengthen the 
training of student teachers. 

Continuous experiences such as conferences and 
workshops are needed which will help all supervis- 
ors and teacher educators work for the best evalua- 
tion and improvement of the teacher education pro- 
gram. 

Communities that understand the teacher educa- 
tion program in home ec onomics education should 
be elected. 


In-service training for supervising teachers' 


should be planned and conducted by the colleges and 
universities. 

More experience in working with adults will help 
to provide a well-rounded program forstudent 
teachers. 

Student teachers should be involved in at least 
two of the areas of homemaking during the student 
teaching experience. 

Heads of all subject matter areas should meet to 
formulate policies and coor dinate the program for 
training teachers. 

All off-campus supervisors should meet on the 
campus during the student teaching period to evalu- 
ate the total teacher training program; from this 
should evolve a long-time plan for improvement. 

The teacher education program in home econ- 
omics may be strengthened by using the same su- 
pervisors each year when possible. 

A comprehensive study 5 hould be made of the 
background of student teachers, including work with 
children and groups to determine help needed. 

More emphasis should be placed on cooperative 
planning and evaluation of the total student teaching 
experience. This should be a continuous experi- 
ence. 

Implications for Improving Teacher Education Pro- 
grams in Home Economics 

Evaluate the practices given in this study which 
signify the greatest import in training teachers in 
home economics education. 

Evaluate the strengths and weaknesses of the 
program concerned as compared with the findings of 
this study. 

Determine ways in which the teaching load of the 
teacher educator in home economics inthe program 
concerned may be improved to perm it moretime 
for supervision of student teachers on and off eam- 

us. 
j Employ а supervisor for the high school labora- 
tory to carry on full responsibility of classes, Fu- 
ture Homemakers, home visits, home project s and 
other extra-class activities. 

Plan the schedule for the teacher educator to al- 
low time for conferences with the off-campus super- 
visor. 

Plan the teaching load of the teacher educator to 
include subject matter courses and other responsi- 
bilities as reported in this study by 94 percent. 

Delegate to the teacher educator the responsibil - 
ity of selecting the off-campus center, a responsi- 
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bility of 87 percent reporting in this study. 

Include the director of teacher educator, the 
teacher educator in home economics and state 
homemaking supervisory staff in selecting the off- 
campus center. 

Plan in-service training in supervision for off- 
campus supervisors. 

Plan reimbursement to teacher educator for ex- 
penses incurred while visiting off-campus school as 
reported by 90 percent of the teacher educators. 

Plan meetings on the campus for home economics 
supervisors, student teachers, and the teacher ed- 
ucator in home economics education. 

Plan the schedule for the teacher educator in 
home economics to include time for visiting alumnae 
teaching their first year. 

Plan continuous studies in the home economics 
education department which will determine weak- 
nesses in the present program and help to strength- 
en the home economics curriculum. 

Develop plans for recognition to be given to off- 
campus supervisors, including status and financial 
reimbursement. 

Encourage supervisors who do not have a mas- 
ter's degree to be working toward one. 

Require in the near future that all supervisors 
have a course in supervision. 
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A New Method for Evaluating the Effectiveness of 


Treatment of Learning Difficulties 


FRIEDA LIBAW, FRANCES BERRES, JAMES C. COLEMAN 


ANY PSYCHOLOGY clinic which offers special 
treatment services is faced with the problem of e- 
valuating the effectiveness of these services. The 
staff of the UCLA Psychology Clinic School has felt 
this responsibility keenly. Inthis school, special 
techniques are used in a remedial program designed 
to help pupils of average or superior intelligence 
overcome serious learning difficulties— difficulties 
which have resulted in severe retardation in basic 
School skills. 

Investigators who haveattempted to evaluate the 
success of other programs with similar goals have 
typically used one of three conventional methods of 
appraisal: a) the follow-up type of study in which the 
attempt is made to ascertain pupil competence in a 
regular school placement at some specified time af- 
ter treatment, b) the control-experimental group 
comparison type of study in which the improvement 
made by a treatment group is compared with that of 
a matched group of non-treated subjects, and c) the 
achievement method in which achievement scores 
before treatment are compared with achie ve m ent 
Scores after treatment. 

None of these methods has proven entirely satis- 
factory. The first method suffers from the practi- 
cal difficulty of keeping track of numbers of subjects 
over sustained time periods and of determining the 
influence of experiences intervening be tw een time 
of termination of treatment and time at which fol- 
low-up is undertaken. The second method denies 
needed services to children in order that they may 
Serve as control subjects; the third method does not 
take into account individual differences inthe sever- 
ity of initial disability, e.g., the relative gain made 
by a subject suffering from severe learning disa- 
bility may be much greater than the gain made by a 

less handicapped subject, even though each of them 
makes the same objective gain in achievement test 
Scores. The method proposed in this paper avoids 
these difficulties while at the same time it yields a 
more meaningful measure of individual achievement. 


The Method 


The method itself may be considered to consist 
of six basic steps. These are: 
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1. Obtaining measures of achievement prior to 
treatment. 

2. Computation of the individual's rate of learn- 
ing before the start of treatment. 

3. Extrapolation, on the basis of this computed 
rate, to predict achievementafter a given time in- 
terval. 

4. Obtaining a measure of achievement after 
treatment has been underway for this given interval. 

5. Comparison of the predicted measure with the 
actual measure obtained at a period during or after 
treatment. 

6. Computation of a sign test of the significance 
of the difference between the predicted and the ob- 
tained measures of achievement. 

In the application of the method at the Clinic 
School, these steps take the following form: 

l. The measures ofachievement prior to the on- 
Set of treatment are taken from the scores obtained 
either on the Stanford Achievement Tests огітот 
the California Achievement Tests administered at 
the time the pupil enters the Clinic School. Alter- 
nate forms of a single test are used throughout. 

2. Since selection procedures insure that all 
Clinic School pupils are of average or superior in- 
telligence, the computation of the rate of lear ing 
is based on the expectation that the pupil should have 
made normal school progress and that he should be 
at a grade level commensurate with his chronologi- 
cal age—had it not been for emotional or other fac- 
tors which interfered with normal learning. There- 
fore, the rate of learning is co mputed by dividing 
the pupil’s test Score, expressed as a grade-equiv- 
alent, by the expected grade-place ment for his 
chronological age. (If the method were to be ap- 
plied to retarded or gifted children , mental-age 
grade-equivalents might be used.) For example, 
the normally intelligent child of nine should be at the 
fourth grade level. If his test scores place him at 
Second grade, his rate of learning wouldbe 2 divid- 
ed by 4, giving a rate of learning of 0. 5. 

Ап extrapolation is made on the basis of this 
computed rate to predict achievement after a given 
time interval. The pupil’s expected score after а 
specified time interval is obtained simply by multi- 
plying rate by time. Thus our hypothetical pupil 
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with the 0.5 learning rate would be expected, after 
atwo-year period, to have increased his achieve- 
ment score by one year. He could, in other words, 
be expected to test at the third grade levelevenif 
no special treatment had been instituted. 

4. The actual grade placement at the end of the 
specified time interval is obtained by administering 
an alternate form of the same standardized achieve- 
ment test that had been given before treatment (Cf. 
item 1 under heading B). 

5. The actual scores made by the pupil on these 
tests are then compared with the scores that could 
be expected if the learning rate had remained con- 
stant. A simple subtraction of the predicted score 
from the actual scoregives the amount of gain or 
loss in achievement. If, for example, at the end of 
two years, the student actually scores at the fifth 
grade level, even though at that time he would still 
be below expectedgrade level for his age, he would 
have made a gain of two years beyond the year's 
progress that would have been predictedon the basis 
of his past performance. Moreover, for the two- 
year period of treatment, his rate of learning would 
have been 1.5. The ratio of the rate of learning 
during treatment to the rate of learning prior to 
treatment constitutes a measure of the change of 
rate of learning, presumably contributed by the 
treatment process. In this instance, the ratio of 
rates would be equal of 1.5 over 0.5, indicating that 
the pupil had learned three times as fast during 
treatment as he had prior to its onset. Obviously, 
the ratio may also be expressed asa percentage in- 
crease in rate. Any ratiogreater than 1. 0 indicates 
an increase in rate. Any ratio below 1.0 indicates 
а decrease. We thus have two obtainable measures 
for the appraisal of the individual pupil’s progress. 
First is the gain (or loss) in grade - equivale nts 
from predicted to actual measures of achievement. 
Second is the measure of rate of increase or rate 
of decrease in learning. 

6. The sixth step, the computation ofa sign test 
for determining the significance of differences of 
predicted to actual achievement, provides a simple 
and convenient means for appraising the effective- 
ness of the remedial program as a whole rather 
than its effect on the individual. 


Discussion 


The present method permits the assessment of 
an individual pupil's progress by giving а compari- 
son of his present to his past rate of learning. 
Since increase in rate of learning is essential ifthe 
individual is to overcome his retardation, this com- 
parison provides a meaningful picture of the stu- 
dent's response to treatment a nd of the specific 
progress which he, as an individual, is making. 
It also provides a basis for prognosis in relation to 
the probable amount and duration of treatment that 


will be required. ү 
The use of the sign test makes it possible to de- 


termine whether an entire group’s рег formance is 
significantly better after a specified treatment in- 
terval. While it is less powerful than parametric 
tests, it is considered more valid ina clinic setting 
because it does not rest onanyassumptions of nor- 
mality of distribution or homogeneity of variance in 
the population, nor does itrequire that treatment be 
the same for all members of a group— all of which 
are required for the legitimate use of t-tests and F 
-ratios (1, 2). Thesigntest only requires that both 
members of a single pair, each assigned to one of 
the two groups, be directly comparable. Since the 
proposed method directly compares the scores of 
the same pupil before and after treatment, each pu- 
pil effectively serves as his own control. The 
matching pairs in eachgroup is maximally accurate 
for such usually uncontrolled variables as motiva- 
tion and anxiety, and any distortion due to uncon- 
trolled differential treatment of matched pairs is 
eliminated. 

The proposed method, however, rests ontwo as- 
sumptions which should be explicitly examined. The 
first of these is that the learning has proceeded at 
a uniform rate prior to the onset of treatment, 
Most psychologists would question this assumption, 
pointing to the evidence of spurts and plateaus in 
learning curves. However, itshould be remember- 
ed that the rate of learning used by the present 
method is computed on the basis of scores made on 
standardized achievement tests and that these 
scores represent the operational definition of learn- 
ing in the present method. Standardization proce- 
dures inwell-constructed tests adj ust grade- level 
equivalents so that a uniform rate of learning equal 
to 1. 0 will be obtained empirically by the majority 
of school pupils. They will make one grade-level 
gain for each year spent in school. Also from an- 
other point of view, the assumption of a uniform 
rate of learning is justified since it represents, in 
effect, a mean of the learning rates over time and 
the mean of past performance would appear to be 
the best single predictive index for performance in 
the future. 

Moreover, there is indication that in pupils with 
learning disabilities who do not receive treatment, 
there tends to be a continual decline in the rate of 
learning when achievement test scores are used as 
the basis for computation. Ifthis is So, and both 
logic and inspection of our data indicate that it is, 
then the assumption of a uniform rate of learning by 
the present method willgive an underestimate of the 
pupil's actual progress during or after treatment. 
If the uniform rate is used in evaluative research, 
it is legitimate for the researcher to bias results 
against his hypothesis and for practical purposes, 
when progress is indicated by the present method, 

it can be viewed as genuine. / 

The second assumption underlying the proposed 
method is not unique to it but is basic to nearly all 
evaluative research which uses achievement test 
Scores as measures of pupil progress. Thereis an 
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implicit assumption that the various levels and al- 
ternate forms of a standardized achievement test 
are equated and that no significant practice effects 
occur in a test-retest program using different forms 
of the same test. The fact is that the correlations 
between scores obtained on various forms are not 
perfect, that practice effects are unknown and un- 
accounted for and that no scientifically based means 
exist for appraising the reliability of a test for an 
individual. There is even less evidence of compar- 
ability among various types of achievem ent tests 
standardized on different populations and using dif- 
ferent types of items to arrive at scores for simi- 
larly labeled achievement categories. Careful con- 
sideration, therefore, should be given to the relia- 
bility and validity of tests of achievement and cau- 
tion should be exercised in drawing conclusions 
from the use of this or any method of evaluation 
which uses test scores as its fundamental data. No 
method can be better than the measures it employs. 
However, all methods of evaluation are dependent 


on some measure of achievement, and the present 
method is under no greater handicap in this respect 
than are other methods which use s tandardizeda- 
chievement tests for assessing improvement. 

The present method recommends itself for use 
not only for evaluating the effectiveness of remedial 
treatment in learning difficulties, but also as a re- 
search tool in relating personality, task, and pro- 
cedure variables. It can also be used as a method 


„for assessing the effectiveness of psychotherapy or 


other procedures which are adjunctive to the reme- 
dial treatment proper. 
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The TMI Self-Tutoring Program in Spelling Compared 
with Teacher and Flash Card Taught Programs * 


JOHN J. BUZBY, Whitpain Elementary School 


LESTER MANN, Montgomery County Public. Schools 


AS PROGRAMMED learning and machine-taught 
materials become increasingly prominent on the 
educational scene, it is important for teachers and 
School administrators to evaluate the effectiveness 
of individual programs in their application to spe- 
сінс classroom populations. The present study 
represents such an attempt. It is a comparison of 
the TMI-Grolier Fundamentals of Spelling Program. 
as presented on the Min/Max Teaching Machine, 
with flash card self-tutoring, and teacher-taught 


The TMI spelling program 18 part of the TMI- 
Grolier Modern English Series (Teaching Machines 
Inc., Albuquerque, 1961) For effective usage of 
the program, а third grade reading know ledge is 
recommended. The program consists of 600 pages 
with 3000 frames. Approximate study time report- 
ed by the catalog is 12 to24 hours. The program 
enphasizes the following: *cVowel-consonant iden- 
tification. Intensive practice on familiar spelling 
rules. Proof reading. Contractions. Homonyms. 
....Basic rules for using the dictionary” (p. 2). 
The program purports to teach the student now to 
spell 400 complete words and “the significant 
parts of hundreds more” (p. 2). 

Previous research or work with the program, 
as reported in the catalog, included work with six 
college freshmen who were poor spellers. Accord- 
ing to their instructor, ‘‘the program did for them 
in an average 12 à hours what 12 years of formal 
training in spelling failed to do” (p. 15). Another 
study involved 23 sixth graders who «improved, on 
the average, over a year in spelling achievement 
after 21 hours with the TMI-Grolier spelling pro- 


gram" (р. 15). 
Procedure 


The TMI-Grolier Fundamentals of Spelling pro- 
gram does not have à separate list of the words be- 
ing taught included with its program. The experi- 


* Appreciation is expressed to the following who а : 1 
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menters combed through the 600 pages of the latter 
to identify them. While the catalog indicates that 
440 words are taught through the program, only 384 
were extricated by the experimenters. These were 
prepared in spelling lists and sets of typed flash 
cards. 

The subject population for the study came from 
fourth grade classes at the Whitpain Ele mentary 
School, an accelerated, an average, and a slow 
learning class. Twelve subjects were s e lected 
from each class, and assignedto the various exper- 
imentalconditions. These were matched by threes 
on the basis of the spelling scores achieved on the 
Iowa Testof Basic Skills. Sex factors could not be 
controlled in light of the small number of subjects 
employed. АП of the latter were at or above a 
third grade reading level. 

Four children from each class were assigned to 
each of the experimental conditions--machine, 
flash cards, and teacher. The machine groups 
consisted of those children being tutored on the TMI 
Spelling program utilizing the Min/Max teaching 
machines. The card groups were com prised of 
those subjects who were given flash cards with the 
words drawn from the TMI spelling program. They 
were required to study thirty words, per experi- 
mental session, writing each word ten times on 
their tablets. These first two groups were assign- 
ed to separate study halls for the experiment, four 
children to each room. The teacher groups con- 
sisted of those children who were taught the list of 
words from the TMI spelling program by their 
teachers. These were taught within their regular 
classroom settings, together with their fellow stu- 
dents, excepting those who were assigned to the ex- 
perimental groups. The study commenced on Jan- 
uary 9, 1961 and was completed on June 2, 1961. 
A half hour a day was allowed for the experiment. 
It is to be noted that this program was provided in 
addition to the regular spelling regimen offered at 
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the fourth grade level. 

The evaluation techniques employed on the pres- 
ent occasion were two-fold. Firstly, the subjects 
were tested both before and following the experiment 
with the list of words drawn directly from the TMI 
spelling program. Secondly, they were compared 
on the spelling section of the Iowa Test of Basic 
Skills. This two-fold approach was indicated in that 
the TMI spelling program purports to teach funda- 
mentals of spelling that are useful in dealing with 
spelling problems above and beyond the actual 
words presented in the program. It was felt that 
these benefits should be reflected in results of the 
Iowa Test rather than in a direct test of the spelling 
words presented in the program. 

Analyses of variance, in treatment x levels de- 
signs, were employed to evaluate the effectiveness 
of each of the approaches to teaching spelling. 


Results 


Tables I and II present the results of this exper- 
iment in terms of the meanachievement for each of 
the twelve subject groups of the experiment. 

Table I presents the mean achievement in re- 
gard to the spelling word gain on the TMI spelling 
program. Table II presents the mean achievement 
in regard to grade level gain, in months on the 
ITBS. 


SPELLING WORD GAINS ON TMI WORD LIST 
MEME ii————————A —— 


TABLE I 


The analyses of variance, for boththe TMI word 
list and the ITBS results, did not yield any F’s of 
statistical significance, nor did they indicate any 
sort of statistical trend. 

In light of the small number of subjects assign- 
ed to each of the groups, there is a question as to 
whether an extension of the studyto larger groups 
of subjects would have resulted in findings favorable 
to one or the other of the techniques employed. Ex- 
amination of Table I indicates thatthe group using 
the flash cards learned more words from the TMI 
program than did those utilizing either machinesor 
those assigned to teachers. Table II indicates that 
the teacher taught program favored overall spelling 
improvement, as measured by the ITBS. 


Summary and Conclusions 


The TMI-Grolier Fundamentals of Spelling pro- 


gram, presented on the Min/Max Teaching Machine | 


was compared to flash card and teacher taught pro- 
grams in respect to the learning of the spelling 
words directly presented in the program and on the 
Iowa Test of Basic Skills. The null-hypothesis was 
sustained in both comparisons. On the basis of the 
present study, the use of the TMI self-tutoring 


program in spelling does not appear to hold any ad- 


vantages over ordinary classroom or flash card 
taught programs. 


Method Accelerated Average Slow АП 
Class Class Class Classes 
ЕЕЕ АВЫ зор, 2 
Machine 43.5 51.2 45.0 48.6 
Cards 39.2 67,5 43.0 57.3 
Теасһег 34.0 65. 2 39.5 46.3 
All Methods 38.9 63.3 42.5 50.7 
TABLE П 


SPELLING GRADE LEVEL GAINS ON ITBS* 


Method Accelerated Average Slow АП 
Class Class Class Classes 
Machine 8.2 6.2 3.0 5.8 
Cards 11.2 10.5 4.2 8.7 
Teacher 15.0 8.2 13.2 12.2 
АП Methods 11.5 9.1 6.8 8.8 


* In Months 


| 
| 


| 


| 
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= Field News 


THE INSTITUTE FOR COMMUNICATIONS RESEARCH of Stanford Uni- 
versity reports that a survey shows that between 20and25 percent of the adults in а 
telecast area are regular viewers of educational television station programs in- 
tended for home viewing. 
ж жж ж 


THE TRIMESTER SYSTEM at the University of Pittsburgh has enabled 
53 students to earn a college degree in two and two-thirds years instead of the us- 
ualíour. The trimester system is an eleven-month academic year divided into 
three terms: each term is equivalent to a semester at another school. Ofthe 24 
women and the 29 men who started in 1959, 33 have been in liberal arts, 15 in ed- 
ucation, four in business administration, and one in industrial engineering. All 
students had high scholastic averages. 

ж жж ж 


INVESTMENT BANKERS ASSOCIATION OF AMERICA reports that only 
68 percent of the school bond issues submitted (68 percent by volume, 71 percent 
by number) last year were approved. Eighty-two percent by volume and 78 per - 
cent by number had been approved the year before; whereas, five years ago 75 per- 
cent by volume and 79 percent by number had been approved. Тһе question is 
raised that perhaps the taxpayer is rebelling against new School bond issues paid 


out of local property taxes. OMEN 


HIGH-SCHOOL STUDENTS ARE IMPROVING according to Lee A. Du- 
Bridge, President of the California Institute of Technology. Ataconference оп the . 
American high-school graduate, held in New York last April, President DuBridge 
pointed out: 

1. The average Caltech freshman was at the 75th percentile on the verb- 
al aptitude test on national norms in 1951; whereas the 1961 average 
Caltech freshman was at the 97th percentile. 

2. The 1951 average Caltech freshman was at the 89th percentile in the 
mathematical aptitude test; whereas the 1961 group was at the 99th 
percentile. 

3. The two groups fell at the 82nd and the 97th percentile, respectively, 
on the physics test. 

4, The two groups fell at the 68th and the 89th percentile, respectively, 


on the English test. 
President DuBridge further said that evidence from various colleges 


supports the conclusion that while not all American high schools rank with the best, 
the best are setting a pace which sets them apart as the best in the world. 
* * * * 


THE NATIONAL SCIENCE FOUNDATION has announced that 55 grants 
totaling over six-hundred-thousand dollars has been made to 38 state academies of 


i instituti ivities for the improvement of 
science and comparable institutions to support activi pro 
science education during the academic year 1962-63. These programs will operate 


in 37 states, the District of Columbia, and Puerto Rico. 
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TWO RESEARCH FELLOWSHIPS IN PSYCHOMETRICS will be offered 
by the Educational Testing Service for 1963-64. These fellowships, which can 
lead to the PhD degree, are open to men who are acceptabletothe Graduate School 
of Princeton University. Each fellowship will carry a stipend of $3950 per year 
and normally is renewable. Fellows will be engaged in part-time research in the 
general area of psychological measurements at the offices of the Educational Test- 
ing Service and will, in addition, carry anormal program of study in the Graduate 
School. 

The closing date for completing applications is January 4, 1963. Infor- 
mation and application blanks will be available about September 15and may be ob- 
tained from: Director of Psychometric Fellowship Program, Educational Testing 
Service, Princeton, New Jersey. А 

* k ж ж* 


AN EXPERIMENT TO TEACH UNDERPRIVILEGED CHILDREN TO 
READ AT THE AGE OF FOUR is being conducted by Professor Fred L. Strodtbeck, 
Director of the Social Psychology Laboratory of the University of Chicago. The 
purpose of this experiment is to determine whether pre-school reading experience 
will enable the underprivileged child to overcome his disadvantage and thereby be 
less likely to fail in school. It is hoped that success in school might minimize the 
the possibility of future delinquency. 


* * * * 


THE STUDY OF STATISTICS should be undertaken relatively early in 
professional work of education students to allow for more benefits from reading 
professional journals. This is the recommendation of a study recently completed 
by Laurence H. Harshbarger at the School of Education of the University of Penn- 
sylvania. Dr. Harshbarger reviewed seven education periodicals for the period 
1945 to 1955 to determine the extensiveness with which articles were saturated 
with statistical terms and concepts. The findingsofthis investigation seem to war- 
rant the following conclusions: 

1. There was an increase both in the percent of articles containing sta- 
tistical terms and іп the percent of articles making either brief or ex- 
tensive use of statistical terms beyond the level of simple descrip- 
tive statistics. 

2. Nearly ten percent of the articles in the total sample were considered 
to be well saturated with statistical terms beyond the level of simple 
descriptive statistics. 

3. Certain comparatively new terms relating to statistical inference ap- 
peared to be increasing in use. 

4. Terms which fall under the general heading of probability, sampling 
distributions, t, F, and chi Square are found to have very much in- 
creased usage. 


* жж ox 


Readers are invited to send items of research interest to 


John Schmid, Professor, College of Education, University 
of Arkansas, Fayetteville, Arkansas. 
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